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expression of a gene of interest in a plant comprising, introduc- 
ing into the plant a first nucleotide sequence comprising, the 
gene of interest operatively linked to a first regulatory region, 
and an operator sequence capable of binding a fusion protein, 
and a second nucleotide sequence comprising a second regula- 
tory region in operative association with a nucleotide sequence 
encoding the fusion protein. The fusion protein comprising, a 
DNA binding protein, or a portion thereof, capable of binding 
the operator sequence, and a recruitment factor protein, or a 
portion thereof, capable of binding a chromatin remodelling 
protein. In this manner, expression of the second nucleotide 
sequence produces the fusion protein that regulates expression 
of the gene of interest. 
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Regulation of Gene Expression Using Chromatin 
Remodelling Factors 

The present invention relates to the regulation of gene expression. More 
5 particularly, the present invention relates to the control of gene expression of one or 
more nucleotide sequences of interest in transgenic plants using chromatin 
remodelling factors. 



10 BACKGROUND OF THE INVENTION 

Transgenic plants have been an integral component of advances made in 
agricultural biotechnology. They are necessary tools for the production of plants 
exhibiting desirable traits (e.g. herbicide and insect resistance, drought and cold 

15 tolerance), or producing products of nutritional or pharmaceutical importance. As the 
applications of transgenic plants become ever more sophisticated, it is becoming 
increasingly necessary to develop strategies to fine-tune the expression of introduced 
genes. The ability to tightly regulate the expression of transgenes is important to 
address many safety, regulatory and practical issues. To this end, it is necessary to 

20 develop tools and strategies to regulate the expression of transgenes in a predictable 
manner. 

Several strategies have so far been employed to control plant gene/transgene 
expression. These include the use of regulated promoters, such as inducible or 

25 developmental promoters, whereby the expression of genes of interest is driven by 
promoters responsive to various regulatory factors (Gate, 1997). Other strategies 
involve co-suppression (Eisner et al., 1998) or anti-sense technology (Kohno-Murase 
et al., 1994), whereby plants are transformed with genes, or fragments thereof, that are 
homologous to genes either in the sense or antisense orientations. Chimeric RNA- 

30 DNA oligonucleotides have also been used to block the expression of target genes in 
plants (Beetham et al., 1999; Zhu et al., 1999). 
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PosttranslationaT modifications of histories in chromatin are important 
mechanisms in the regulation of gene expression. Protein-protein interactions 
between histones H3, H4, H2A and H2B form an octomeric core which is wrapped 
with DNA. N-terminal tails of histones protrude from the octamer and are subject to 
5 posttranslational modification involving acetylation and deacetylation of conserved 
lysine residues. A nucleosome comprises 26 lysine residues that may be subject to 
acetylation. Acetylation of core histones, including H4 and H3 via histone 
acetyltransferase (HAT), is correlated with transcriptionally active chromatin of 
eukaryotic cells. Acetylation is thought to weaken the interactions of histones with 
10 DNA and induce alterations in nucleosome structure. These alterations enhance the 
accessibility of promoters to components of the transcription machinery, and increase 
transcription. HATs have been identified in yeast, insects, plants and mammals (e.g. 
Kolle et al. 1998), and are typically components of multiprotein complexes including 
components of RNA polymerase II complex, TFHD, TFIIC and recruitment factors 
15 (e.g. see Lusser et al. 2001 for review). 

Histone deacetylation, via histone deacetylase (HD, HDA, HDAC), is thought 
to lead to a less accessible chromatin conformation, resulting in the repression of 
transcription (e.g. Pazin and Kadonaga, 1997; Struhl, 1998; Lusser et al., 2001). The 

20 role of the yeast histone deacetylase, RPD3, in transcriptional repression was first 
discovered through a genetic screen for transcriptional repressors in S. cerevisiae 
(Vidal and Gaber, 1991). Since then, a number of yeast and mammalian HDAC genes 
have been cloned (Rundlett et al., 1996; Emiliani et al., 1998; Hassig et aL, 1998; 
Verdel and Khochbin, 1999). Most eukaryotic histone deacetylases show some 

25 sequence homology to yeast RPD3, suggesting that these proteins are all members 
derived from a single gene family (Khochbin and Wolffe, 1997; Verdel and 
Khochbin, 1999). hi yeast and mammalian cells, the RPD3/HDACs mediate 
transcriptional repression by interacting with specific DNA-binding proteins or 
associated corepressors and by recruitment to target promoters (Kadosh and Struhl, 

30 1997; Hassig et al., 1997; Nagy et al., 1997; Gelmetti et al., 1998). Recently, a second 
family of histone deacetylases, HDA19 and related proteins, were identified in yeast 
and mammalian cells (Rundlett et al., 1996; Fischle et al., 1999; Verdel and 
Khochbin, 1999). The deacetylase domain of HDA19-related proteins is homologous 
to but significantly different from that of RPD3 (Fischle et al., 1999; Verdel and 
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Khochbin, 1999). These proteins also appear to be functionally different from RPD- 
like proteins in yeast cells (Rundlett et al., 1996). WO 97/35990 discloses 
mammalian-derived histone deacetylase (HDx) gene sequences, gene products, and 
uses for these sequences and products. The down regulation of gene expression in 
5 plants using histone deacetylase, fused to a DNA binding domain that targeted the 
fusion protein to a specific gene, has been demonstrated (Wu et al., 2000a; Wu et al., 
2000b). 

The present invention embraces the use of fusion proteins comprising a DNA 
10 binding domain fused to a recruitment factor, that is capable of recruiting chromatin 
remodelling proteins such as HDAC and HAT, to specific DNA sites to regulate 
expression of a gene of interest. Also disclosed is the use of fusion proteins 
comprising a DNA binding portion fused to histone acetyltransferase (HAT) to 
regulate transcription of a gene of interest. 



It is an object of the invention to overcome disadvantages of the prior art. 

The above object is met by the combinations of features of the main claims, 
the sub-claims disclose further advantageous embodiments of the invention. 

SUMMARY OF THE INVENTION 



The present invention relates to the regulation of gene expression. More 
particularly, the present invention relates to the control of gene expression of one or 
25 more nucleotide sequences of interest in transgenic plants using chromatin 
remodelling factors. 

According to an aspect of an embodiment of the present invention, there is 
provided a method to regulate the expression of a gene of interest in a plant 
comprising: 
30 i) introducing to the plant: 

1) a first nucleotide sequence comprising, 

a) the gene of interest operatively linked to a first regulatory region, 

b) an operator sequence capable of binding a fusion protein, and; 
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2) a second nucleotide sequence comprising a second regulatory region in 
operative association with a nucleotide sequence encoding a fusion protein, 
the fusion protein comprising, 

a) a DNA binding protein, or a portion of a DNA binding protein 
capable of binding the operator sequence, and; 

b) a recruitment factor protein, or a portion thereof, capable of binding 
a chromatin remodelling protein, 

ii) growing the plant, wherein expression of the second nucleotide sequence 
produces the fusion protein and regulates expression of the gene of interest. 

The present invention also embraces the methods as defined above, wherein 
the first and second regulatory regions are either the same or different and are selected 
from the group consisting of a constitutive promoter, an inducible promoter, a tissue 
specific promoter, and a developmental promoter. 



The present invention also relates to a method of enhancing the expression of 
a gene of interest or enhancing the transcription of a gene of interest in a plant 
comprising: 

i) introducing to the plant: 

20 1) a first nucleotide sequence comprising, 

a) the gene of interest operatively linked to a first regulatory region, 

and; 

b) an operator sequence that interacts with a fusion protein; 

2) a second nucleotide sequence comprising a second regulatory region in 
25 operative association with a nucleotide sequence encoding a fusion protein 

comprising, 

a) a DNA binding protein, or a portion thereof, capable of binding the 
operator sequence, and; 

b) a histone acetyltransferase (HAT) protein, or portion thereof, 
30 capable of increasing histone acetylation; 

ii) growing the plant, wherein expression of the second nucleotide sequence 
produces the fusion protein and increases transcription of the gene of interest. 
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The present invention pertains to a method of regulating the expression of a 
gene of interest or enhancing the transcription of a gene of interest in a plant 
comprising: 

i) introducing to the plant: 

1) a first nucleotide sequence comprising, 

a) the gene of interest operatively linked to a first regulatory region, 

and; 

b) an operator sequence that interacts with a fusion protein; 

2) a second nucleotide sequence comprising a second regulatory region in 
operative association with a nucleotide sequence encoding a fusion protein 
comprising, 

a) a DNA binding protein, or a portion thereof, capable of binding the 
operator sequence, and; 

b) a chromatin remodelling factor, or portion thereof, capable of 
increasing histone acetylation; 

ii) growing the plant, wherein expression of the second nucleotide sequence 
produces the fusion protein and regulates the transcription of the gene of interest. 

The present invention also embraces the methods as defined above, wherein 
the first and second regulatory regions are either the same or different and are selected 
from the group consisting of a constitutive promoter, an inducible promoter, a tissue 
specific promoter, and a developmental promoter. 

The first and second nucleotide sequences may be placed within the same or 
within different vectors, genetic constructs, or nucleic acid molecules. Preferably, the 
first nucleotide sequence and the second nucleotide sequence are chromosomally 
integrated into a plant or plant cell. The two nucleotide sequences may be integrated 
into two different genetic loci of a plant or plant cell, or the two nucleotide sequences 
may be integrated into a singular genetic locus of a plant or plant cell. However, the 
second nucleotide sequence may be integrated into the DNA of the plant or it may be 
present as an extra-chromosomal element, for example, but not wishing to be limiting 
aplasmid. 
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Also, according to the present invention there is provided a method for 
selectively controlling the transcription of a gene of interest, comprising: 

i) producing a first plant comprising a first genetic construct, the first genetic 
construct comprising a first regulatory region operatively linked to the gene of 

5 interest and at least one operator sequence capable of binding a fusion protein; 

ii) producing a second plant comprising a second genetic construct, the second 
genetic construct comprising a second regulatory region in operative 
association with a nucleic sequence encoding the fusion protein, the fusion 
protein comprising, 

10 a) a DNA binding protein, or a portion thereof, capable of binding the 

operator sequence, and; 

b) a recruitment factor protein, or a portion thereof, capable of binding 
a chromatin remodelling protein; 

iii) crossing the first plant and the second plant to obtain progeny comprising both 
15 the first genetic construct and the second genetic construct, the progeny 

characterized in that the expression of the fusion protein regulates expression 
of the gene of interest. 

The present invention also embraces the methods as defined above, wherein 
20 the first and second regulatory regions are either the same or different and are selected 
from the group consisting of a constitutive promoter, an inducible promoter, a tissue 
specific promoter, and a developmental promoter. 

The present invention also pertains to the method as just defined, wherein the 
25 nucleic acid sequence encoding the fusion protein is optimised for expression in a 
plant, and that the nucleotide sequence encodes a nuclear localization signal. 



Also, according to the present invention there is provided a method for 
selectively controlling the transcription of a gene of interest, comprising: 
30 i) producing a first plant comprising a first genetic construct, the first genetic 
construct comprising a first regulatory region operatively linked to the gene of 
interest and at least one operator sequence capable of binding a fusion protein; 
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ii) producing a second plant comprising a second genetic construct, the second 
genetic construct comprising a second regulatory region in operative 
association with a nucleic sequence encoding the fusion protein comprising, 

a) a DNA binding protein, or a portion thereof, capable of binding the 
5 operator sequence, and; 

b) a HAT protein, or portion thereof, capable of histone acetylation in 

plants; 

iii) crossing the first plant and the second plant to obtain progeny comprising both 
the first genetic construct and the second genetic construct and characterized 

10 in that the expression of the fusion protein up-regulates the expression of the 

gene of interest. 

The present invention also provides the method as just defined, wherein, the 
nucleic acid sequence encoding the fusion protein is optimised for expression in the 
1 5 plant, and that the nucleic acid sequence encodes a nuclear localization signal. 

The present invention also embraces the methods as defined above, wherein 
the first and second regulatory regions are either the same or different and are selected 
from the group consisting of a constitutive promoter, an inducible promoter, a tissue 
20 specific promoter, and a developmental promoter. 

Furthermore, this invention provides a method to regulate expression of an 
endogenous nucleic acid sequence of interest in a plant comprising: 

i) introducing into the plant a nucleotide sequence comprising, a regulatory region, 
25 operatively linked with a nucleotide sequence encoding a fusion protein, the fusion 

protein comprising, 

a) a DNA binding protein, or a portion thereof, capable of binding a segment 
of a DNA sequence of the endogenous nucleotide sequence of interest; 

b) a recruitment factor protein, or a portion thereof, capable of binding a 
30 chromatin remodelling protein; and 

ii) growing the plant, wherein expression of the nucleotide sequence produces the 
fusion protein that regulates expression of the endogenous nucleic acid sequence of 
interest. 
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• • • 

The present invention also includes a method to regulate expression of an 
endogenous nucleic acid sequence of interest in a plant comprising: 

i) introducing into the plant a nucleotide sequence comprising a regulatory region, 
operatively linked with a nucleotide sequence encoding a recruitment factor protein, 
5 the recruitment factor protein capable of binding an endogenous DNA binding 
protein, the endogenous DNA binding protein characterized in binding a segment of a 
DNA sequence of the endogenous nucleotide sequence of interest, and; 

ii) growing the plant, wherein expression of the nucleotide sequence produces the 
recruitment factor thereby regulating expression of the endogenous nucleic acid 
1 0 sequence of interest. 

This summary of the invention does not necessarily describe all necessary 
features of the invention but that the invention may also reside in a sub-combination 
of the described features. 

15 
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BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features of the invention will become more apparent from the 
following description in which reference is made to the appended drawings wherein: 

5 FIGURE 1 shows the nucleotide and deduced amino acid sequences of wild type 
ROS and a modified ROS of Agrobacterium tumefaciens. Figure 1(A) shows 
the amino acid sequence alignment of known ROS repressors (wild-type ROS, 
SEQ ID NO:l; ROSR, SEQ ID NO:63; ROSAR, SEQ ID NO: 64; MucR, 
SEQ ID NO: 65), and a synthetic ROS (SEQ ID NO: 4). The amino acid 
10 sequence 'PKKKRKV (SEQ ID NO: 6) at the carboxy end of synthetic ROS 

is one of several nuclear localization signals. Figure 1(B) shows the 
nucleotide sequence of a synthetic ROS (SEQ ID NO:2) that had been 
optimised for plant codon usage containing a nuclear localization signal 
peptide (in italics). Optional restriction sites at the 5' end of the sequence are 
15 underlined. Figure 1(C) shows the consensus nucleotide (SEQ ID NO:3) and 

predicted amino acid (SEQ ID NO:4) sequence, of a composite ROS sequence 
comprising all possible nucleotide sequences that encode wild type ROS 
repressor, and the wild type ROS amino acid sequence. The amino acid 
sequence 'PKKKRKV (SEQ ID NO:6) at the carboxy end represents a 
20 nuclear localization signal. Amino acids in bold identify the zinc finger motif. 

Nucleotide codes are as follows: N= A or C or T or G; R= A or G; Y= C or T; 
M= A or C; K= T or G; S= C or G; W= A or T; H= A or T or C; B= T or C or 
G; D= A or T or G; V= A or C or G. Figure 1(D) shows the nucleotide 
sequence of the operator sequences of the virC/virD (SEQ ID NO:27) and ipt 
25 (SEQ ID NO:8) genes. Figure 1(E) shows a consensus operator sequence 

(SEQ ID NO:5) derived from the virC/virD (SEQ ID NOs:66-67) and ipt 
(SEQ ID NOs:68-69) operator sequences. This sequence comprises 10 amino 
acids, however, only the first 9 amino acids are required for binding ROS. 

30 FIGURE 2-4 shows in a diagrammatic form several variations of regulating gene 
expression using the methods of the present invention. 



FIGURE 5 shows schematic representations of nucleotide constructs that place the 
expression of a gene of interest under the control a regulatory region, in this 
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case a CaMV35S regulatory region, modified to contain a ROS operator site. Figure 
5(A) shows the nucleotide construct p74-315 in which a CaMV35S regulatory 
region, modified to contain a ROS operator site downstream of the TATA box, 
is operatively linked to a gene of interest (/^-glucuronidase; GUS). Figure 
5 5(B) shows the nucleotide construct p74-316 in which a CaMV35S regulatory 

region is modified to contain a ROS operator site upstream of the TAT A box 
is operatively linked to the protein encoding region of GUS. Figure 5(C) 
shows the nucleotide construct p74-309 in which a CaMV35S regulatory 
region modified to contain ROS operator sites upstream and downstream of 
10 the TATA box is transcriptionally fused (i.e. operatively linked) to the protein 

encoding region of GUS. Figure 5(D) shows construct p74-118 comprising a 
35S regulatory region with three ROS operator sites downstream from the 
TATA box. The 35S regulatory region is operatively linked to the gene of 
interest (GUS). 

15 

FIGURE 6 shows a schematic representation of a nucleotide construct that places the 
expression of a gene of interest gene under the control of a regulatory region, 
in this case, the tms2 regulatory region that has been modified to contain ROS 
operator sites. Figure 6(A) shows the nucleotide construct p76-507 in which a 
20 tms2 regulatory region is operatively linked to a gene of interest (in this case 

encoding /^-glucuronidase, GUS). Figure 6(B) shows the nucleotide construct 
p76-508 in which a tms2 regulatory region modified to contain two tandemly 
repeated ROS operator sites downstream of the TATA box is transcriptionally 
fused (i.e. operatively linked) to the protein coding region of GUS. 

25 

FIGURE 7 shows a schematic representation of a nucleotide construct that places the 
expression of a gene of interest under the control of a regulatory region, in this 
case actin 2 regulatory region, that has been modified to contain ROS operator 
sites. Figure 7(A) shows the nucleotide construct p75-101 in which an actin2 
30 regulatory region is operatively linked to a gene of interest (the 0- 

glucuronidase (GUS) reporter gene). Figure 7(B) shows the nucleotide 
construct p74-501 in which an actin2 regulatory region modified to contain 
two tandemly repeated ROS operator sites upstream of the TATA box is 
transcriptionally fused (operatively linked) to the a gene of interest (GUS). 
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FIGURE 8 shows Southern analysis of transgenic Arabidopsis plants. Figure 8(A) 
shows Southern analysis of a plant comprising a first genetic construct, p74- 
309 (35S-operator sequence-GUS; see Figure 5(C) for map). DNA was 
5 digested with Clal or Xhol and the blot was probed with the ORF of the GUS 

gene. Figure 8(B) shows Southern analysis of a plant comprising a second 
genetic construct p75-101 (see Figure 7A). HindSR digests were probed with 
NPTH. 

10 FIGURE 9 shows expression of a gene of interest in plants. Upper panel shows 
expression of GUS under the control of 35S (pBI121; 35S:GUS). Middle 
panel shows GUS expression under the control of actin2 comprising ROS 
operator sequences (p74-501; see Figure 7(B) for construct). Lower panel 
shows the lack of GUS activity in a non-transformed control. 

15 

FIGURE 10 shows alignments of bnKCPl and sequence comparison of kinase 
inducible domains (KJDs) in bnKCPl and CREB family members. Figure 
10(A) shows alignment of the deduced amino acid sequences of bnKCPl 
(SEQ ID NO:71), atKCP (SEQ ED NO:72), atKCLl (SEQ ID NO:73) and 

20 atKCL2 (SEQ ID NO:74) proteins. Serine (S)-rich residues and the conserved 

region (GKSKS domain) among the four sequences are single underlined and 
double underlined, respectively. The putative nuclear localization signal 
(NLS) and the phosphorylation site of protein kinase A are indicated by 
asterisks and diamonds, respectively. Figure 10(B) shows alignment of the 

25 amino acid sequences of bnKCPl (SEQ ID NO:75), hydra CREB (hyCREB) 

(SEQ ID NO:77), canfa CREM (cCREM) (SEQ ID NO:80), and mammalian 
ATF-1 (SEQ ED NO:76), CREB (SEQ ED NO:78) and CREM (SEQ ED 
NO:79). Diamonds indicate the conserved phosphorylation site of protein 
kinase A. Figure 10(C) shows a phylogenetic tree of the KEDs sequences 

30 using the NTI Vector program. 



FIGURE 11 shows structural features of bnKCPl. Figure 11(A) shows schematic 
representation of entire bnKCPl protein. Numbers above or under the boxes 
refer to positions of amino acid residues. S-rich (34-58), GKSKS (88-143) and 
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KID (161-215) domains or motifs are shown in dotted boxes, the nuclear localization 
signal (NLS) in black box, and the three acidic motifs (I, n, m) in gray boxes. 
Figure 11(B) shows secondary structure features and hydrophilicity of 
bnKCPl analyzed using DNAstar Protean program. 

FIGURE 12 shows Southern blot analysis of Brassica genomic DNA. Total genomic 
DNA (10 ug/lane) from Brassica napus cv Westar was digested with 
restriction enzymes EcdKL (EI), Xbal (X), Hin&S. (H), Pstl (P), EcoRV (EV) 
and Kpnl (K). The entire ORF of bnKCPl was used as a probe. 

FIGURE 13 shows in vitro interaction of wild type and mutant bnKCPl proteins with 
the GST-HDA19 and GST-Gcn5 fusion proteins. Figure 13(A) shows a 
schematic representation of the bnKCPl and its deletion mutants obtained by 
deletion of C-terminal regions of bnKCPl. Figure 13(B) shows binding 
activities of bnKCPl and its mutants with GST-HDA19, GST-Gcn5 and GST 
alone (negative control), respectively, as indicated. The wild type bnKCPl, 
mutants bnKCPl 1 - 160 and bnKCPl 1-80 , luciferase (as positive control) and 
negative control (no template) were produced using in vitro 
transcription/translation reactions. The translation products were incubated 
with GST fusion proteins or GST and their binding activities were examined 
as described in Example 4. Figure 13(C) shows activation of lacZ reporter 
gene by bnKCPl and its deletion mutants, AbnKCPl 1 " 160 and AbnKCPl 1 - 80 , in 
yeast cells. MaV203 yeast cells carrying plasmid pDBLeu-HD Al 9 and the 
reporter gene were transfected with the plamid pPC86-bnKCPl, pPC86- 
bnKCPl 1 " 160 , pPC86-bnKCPl 1-80 or pPC86 vector only. Yeast strains A and B 
were used as negative and positive controls, respectively. The 0-galactosidase 
activity was assayed using chlorophenol red-/3-D-galactopyranoside (CPRG) 
and was expressed as a percentage of activity conveyed by bnKCPl . 

FIGURE 14 shows the effect of S 188 on the interaction between bnKCPl and GST- 
HDA19 fusion protein. A glycine residue (G 188 ) was introduced by site- 
directed mutagenesis to replace S 188 . The binding activities of wild-type 
bnKCPl and the mutant AbnKCPIG 188 with GST-HDA19 or GST alone 
(negative control) were examined with GST pulldown affinity assay as 
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described in Example 4. Figure 14A shows the introduction of G188 into the KID of 
bnKCPl. Figure 14B shows in vitro protein interaction of bnKCPl and the 
mutant AbnKCPIG 18 * with GST-HDA19 or GST alone. 

5 FIGURE 15 shows expression patterns of bnKCPl mRNA in different tissues. Total 
RNA (20 ug/lane) was isolated from leaves with petioles, flowers, roots, stems 
and immature siliques. 

FIGURE 16 shows expression of bnKCPl gene in response to low temperature, 
10 LaCl 3 and inomycin treatments. Total RNA (20 ug/lane) was isolated from 

leaf blades of four-leaf stage Brassica napus cv Westar seedlings after 
exposure to different stress conditions and analyzed by northern blotting using 
the bnKCPl ORF as probe. Figure 16(A) shows bnKCPl transcript 
accumulation in leaves and stems of seedlings exposed to cold (4°C). Figure 
15 16(B) shows expression pattern of bnKCPl gene after treatment with LaCl 3 

and inomycin. 

FIGURE 17 shows transactivation of the lacZ gene by bnKCPl in yeast. The lacZ 
gene was driven by a promoter containing GAL4 DNA binding sites and 

20 integrated into the genome of yeast MaV203. Figure 17(A) is a schematic 

representation of the bnKCPl and its deletion mutants. Figure 17(B) Yeast 
cells carrying the reporter gene were transfected with the effector plasmids 
pDBLeu-bnKCPl, pDBLeu-bnKCPl 1 " 160 , pDBLeu-bnKCPl 1 " 80 , andpDBLeu- 
bnKCPl 81 " 215 or the pDBLeu vector only. Yeast strains A and B (GibcoL 

25 BRL, Life Technologies) were used as negative and positive controls, 

respectively. The 0-galactosidase activity was assayed using CPRG 
(chlorophenol red- jS-D-galactopyranoside) and was expressed as a percentage 
of activity conveyed by the positive control (strain C). Bars indicate the 
standard error of three replicates. 

30 

FIGURE 18 shows the nuclear localization of GUS-bnKCPl protein in onion cells. 
Figure 18(A) is a schematic diagram of the GUS-bnKCPl fusion construct 
containing the CaMV 35S promoter. The bnKCPl was fused in-frame to the 
GUS reporter gene. • Figure 18(B) shows transient expression of GUS- 
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bnKCPl fusion protMr(top) and GUS alone (bottom) in onion cells. Onion tissues 
were simultaneously analysed using histochemical GUS assay (left) and 
nucleus-specific staining with DAPI (right) as described in Example 4. 



5 FIGURE 19 shows a diagrammatic representation of a strategy for preparing fusions 
between a recruitment factor involved in chromatin remodelling and a DNA 
binding protein. In the non-limiting example shown in this figure, the 
recruitment factor is KID (see Example 4), and the DNA binding protein is a 
zinc finger. 

10 

FIGURE 20 shows alignment of the deduced products of BnSCLl (SEQ ID NO:81), 
AtSCLlS (accession number Z99708) (SEQ ID NO:82) and LsSCL (accession 
number AF273333) (SEQ ID NO:83). Identical and conserved amino acids in the 
three proteins are shown as white letters on a black background and black letter on a 

15 gray background, respectively. Amino acids with weak similarity are indicated as 

white letters on a gray background. Amino acids with no similarity are shown as 
black letters on a white background. The putative nuclear localization signals and 
LXXLL motif are indicated by asterisks and dots, respectively. The VHTTO motif, 
two leucine heptad regions (LHRI and LHRU), PFYRE and SAW motif are 

20 underlined as indicated. 

FIGURE 21 shows a phylogenetic tree of the GRAS family sequences made by the NTI 
Vector program in Brassica napus, Arabidopsis thaliana, Hordeum vulgare, Zea 
mays, Lycopersicon esculentum, Pisum sativum and Oryza sativa. The BnSCLl is 
25 underlined. 

FIGURE 22 shows DNA gel blot analysis of BnSCLl gene. Total genomic DNA (10 
Hg/lane) from Brassica napus was digested with restriction enzymes EcdRl (EI), 
Xbal (X), Hindm (H), Pstl (P), EcoRV (EV) and Kpnl (K), and hybridized with the 
entire ORF of BnSCLl under high stringency conditions. 

30 

FIGURE 23 shows in vitro interaction of wild type and mutant BnSCLl proteins with the 
GST-HDA19 fusion protein. Figure 23(A) is a schematic representation of the 
BnSCLl and its deletion mutants obtained by the deletion of its C-terminal regions. 
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Figure 23(B) shows the binding activities of BnSCLl and its mutants with GST-HDA19. 
The wild type BnSCLl, mutants ABnSCLl 1 " 358 , ABnSCLl 1 " 261 , ABnSCLl 1 " 217 and 
ABnSCLl 1 " 145 , luciferase (positive control) and negative control (no template) were 
produced using in vitro transcription/translation reactions. The translation products 
5 were incubated with GST fusion proteins or GST alone (data not shown) and then- 

binding activities were examined as described in Example 5. Arrow point to band 
representing the in vitro translated ABnSCLl 1 " 145 protein that did not bind to the 
recombinant protein. 

FIGURE 24 shows in vivo interaction of wild type and mutant BnSCLl proteins. Figure 
10 24(A) is a schematic representation of the BnSCLl and its deletion mutants. Figure 

24(B) shows the activation of lacZ reporter gene by BnSCLl and its deletion 
mutants in yeast cells. MaV203 yeast cells carrying plasmid pDBLeu-HDAl 9 and 
the lacZ reporter gene were transfected with the plasmid pPC86-BnSCLl, pPC86- 
BnSCLl 1 " 358 , pPCSe-BnSCLl 1 " 261 , P PC86-BnSCLl 1 ' 217 , pPC86-BnSCLl M45 , 
15 P PC86-BnSCLl 146 - 358 , PPC86-B11SCLI 218 - 438 or pPC86 vector only. The negative 

control yeast strain A and the positive controls yeast strains B and C (GIBCOL 
BRL, Life Technologies) were also used. The p-Galactosidase activity was assayed 
using CPRG (chlorophenol red-p-D-galactopyranoside) and was expressed as a 
percentage of activity conveyed by yeast strain C. Bars indicate the standard error of 
20 three replicates. 

FIGURE 25 shows transactivation of the lacZ gene by BnSCLl protein in yeast. Figure 
25(A) is a schematic representation of the BnSCLl and its deletion mutants. Figure 
25(B) shows the activation of lacZ reporter gene by BnSCLl and its deletion 
mutants in yeast cells. The lacZ reporter gene was driven by a promoter containing 

25 GAL4 DNA binding sites and integrated into the genome of yeast MaV203 cell. 

Yeast cells carrying the reporter gene were transfected with the effector plasmids 
pDBLeu-BnSCLl , pDBLeu-BnSCLl 1 " 358 , pDBLeu-BnSCLl 1 " 261 , pDBLeu- 
BnSCLl 1 " 217 , pDBLeu-BnSCLl 1 " 145 , pDBLeu-BnSCLl 146 " 358 , pDBLeu-BnS CL1 21 8 " 
438 or pDBLeu vector only. Yeast strains A, B, C and D (GIBCOL BRL, Life 

30 Technologies) were used as controls as described in Example 5. The P- 

Galactosidase activity was assayed using CPRG and was expressed as a percentage 
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of activity conveyed by the wild type BnSCLl protein. Bars indicate the standard error of 
three replicates. 

FIGURE 26 shows expression patterns of BnSCLl mRNA in different tissues. Figure 
26(A) is a RNA gel blot analysis of total RNA (20 ug/lane) isolated from leaves, 
flowers, roots, stems and siliques, electrophoresed through a 1.2% agarose gel 
containing formaldehyde and probed with the ORF of BnSCLl as described in 
Example 5. EtBr stained total RNA is shown to indicate even loading. Figure 26(B) 
is a quantitative one-step RT-PCR analysis of total RNA extracted from leaves, 
flowers, roots, stems, siliques and shoots. Quantitative RT-PCR products were 
electrophoresed through a 1% agarose gel and hybridized with 32 P-labelled 5'-end 
fragment (435 bp) of BnSCLl ORF. A 960 bp fragment of the Brassica napus actin 
gene co-amplified with BnSCLl was used as an internal standard as described in 
Example 5. 

FIGURE 27 shows expression of BnSCLl gene in four-leaf stage Brassica napus 
seedlings in the presence or absence of 2,4-D. Total RNA was isolated from the 
fourth leaves after the indicated period of the first foliar application of 1 mM 2,4-D 
and subjected to quantitative one-step RT-PCR. The RT-PCR products were 
analyzed by Southern blotting using the BnSCLl ORF as probe Geft) and the 
blotting results were shown graphically relative to the level of internal standard 
Actin (arbitrary value of 100)(right). 

FIGURE 28 shows kinetics of BnSCLl mRNA accumulation in response to auxin in the 
presence and absence of histone deacetylase inhibitor sodium butyrate. Nine-day- 
old light-grown seedlings were treated with 10 mM sodium butyrate for 24 h 
followed by exogenous 2,4-D application at variable concentrations as indicated. 
Quantitative one-step RT-PCR was used to analyze total RNA extracted from shoots 
(Figure 28A) and roots (Figure 28B) (see legend to Figure 27 Expression of 
BnSCLl in response to 2,4-D was also analyzed using quantitative RT-PCR of total 
RNA isolated from shoots and roots of 10 dpg seedlings in the presence of 50 uM 
NPA, an auxin transport inhibitor, for 24 h before the exogenous application of 2,4- 
D (Figure 28C). 
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FIGURE 29 shows in a diagrammatic form several constructs that may be used to 
regulate gene expression as described in Example 6. 
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The present invention relates to the regulation of gene expression. More 
particularly, the present invention relates to the control of gene expression of one or 
5 more nucleotide sequences of interest in transgenic plants using chromatin 
remodelling factors. 

The following description is of a preferred embodiment by way of example 
only and without limitation to the combination of features necessary for carrying the 
10 invention into effect. 

Gene regulation can be used in applications such as metabolic engineering to 
produce plants that accumulate large amounts of certain intermediate compounds. 
Regulation of gene expression can also be used for control of transgenes across 

1 5 generations, or production of Fl hybrid plants with seed characteristics that would be 
undesirable in the parental line, for example but not limited to, hyper-high oil, 
reduced fiber content, low glucosinolate levels, reduced levels of phytotoxins, and the 
like. In the latter examples, low glucosinolate levels, or other phytotoxins, may be 
desired in seeds while higher concentrations of these compounds may be required 

20 elsewhere, for example in the case of glucosinolates, within cotyledons, due to their 
role in plant defence. Another non-limiting example for the controlled regulation of a 
gene of interest during plant development is seed specific down regulation of sinapine 
biosynthesis, as for example in seeds of Brassica napus. In many instances, transgene 
expression needs to be regulated only in certain plant organs/tissues or at certain 

25 stages of development. The methods as described herein may also be used to control 
the expression of a gene of interest that encodes a protein used to for plant selection 
purposes. For example, which is to be considered non-limiting, a gene of interest may 
encode a protein that is capable of metabolizing a compound from a non-toxic form to 
a toxic form thereby selectively removing plants that express the gene of interest. 



30 



The present invention provides a method to regulate the expression of a gene 
of interest by transforming a plant with one or more constructs comprising: 
1) a first nucleotide sequence comprising, 
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a) a nucleic acid sequence of interest operatively linked to a 
regulatory region, 

b) an operator sequence capable of binding a fusion protein, and; 
2) a second nucleotide sequence comprising a regulatory region in operative 
association with a nucleotide sequence encoding a fusion protein, the fusion 
protein comprising, 

a) a DNA binding protein, or a portion of a DNA binding protein 
capable of binding the operator sequence, and; 

b) a recruitment factor protein, or a portion of a recruitment factor 
protein capable of binding a chromatin remodelling protein, 

wherein binding of the fusion protein to the operator sequence of the first nucleotide 
sequence regulates expression of the nucleic acid sequence of interest from the first 
nucleotide sequence. The operator sequence of the first nucleotide sequence may be 
positioned upstream of the ORF of the nucleic acid sequence of interest. 

These first and second nucleotide sequences may be placed within the same or 
within different vectors, genetic constructs, or nucleic acid molecules. Preferably, the 
first nucleotide sequence and the second nucleotide sequence are chromosomally 
integrated into a plant or plant cell. The two nucleotide sequences may be integrated 
into two different genetic loci of a plant or plant cell, or the two nucleotide sequences 
may be integrated into a singular genetic locus of a plant or plant cell. However, the 
second nucleotide sequence may be integrated into the DNA of the plant or it may be 
present as an extra-chromosomal element, for example, but not wishing to be limiting 
a plasmid. 

By "operatively linked" it is meant that the particular sequences interact 
either directly or indirectly to carry out their intended function, such as mediation or 
modulation of gene expression. The interaction of operatively linked sequences may, 
for example, be mediated by proteins that in turn interact with the sequences. A 
transcriptional regulatory region and a sequence of interest are "operably linked" 
when the sequences are functionally connected so as to permit transcription of the 
sequence of interest to be mediated or modulated by the transcriptional regulatory 
region. 
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By the term "regulate the expression" it is meant reducing or increasing the 
level of mRNA, protein, or both mRNA and protein, encoded by a gene or nucleotide 
sequence of interest in the presence of the fusion protein encoded by the second 
nucleotide sequence, relative to the level of mRNA, protein or both mRNA and 
5 protein encoded by the nucleic acid sequence of interest in the absence of the fusion 
protein encoded by the second nucleotide sequence. 

By the term "fusion protein" it is meant a protein comprising two or more 
amino acid portions which are not normally found together within the same protein in 
10 nature and that are encoded by a single gene. Fusion proteins may be prepared by 
standard techniques in molecular biology known to those skilled in the art (see for 
example Figure 17). In the context of the present invention, at least one of the amino 
acid portions is capable of binding an operator sequence as defined herein. 

15 By the term "binding" it is meant reversible or non-reversible association of 

two components, for example the operator sequence and the DNA binding domain of 
a protein, including a fusion protein, or the recruitment factor protein and chromatin 
remodelling protein as described herein. Preferably, the two components have a 
tendency to remain associated, but are capable of dissociation under appropriate 

20 conditions. Conditions may include, but are not limited to the addition of a third 
component, chemical, etc which enhances dissociation of the bound components. 

By the term "recruitment factor" it is meant a protein or peptide sequence 
capable of interacting with, or binding a chromatin remodelling protein. Preferably, 

25 the recruitment factor and the chromatin remodelling protein interact or bind in a 
manner such that the activity of the chromatin remodelling protein is retained. 
However, by binding the recruitment factor, the activity of the chromatin remodelling 
protein may be modified in some manner. Non-limiting examples of recruitment 
factors include KID, for example bnKCPl, or fragments thereof (Example 4), 

30 BnSCLl, or fragments therof (Example 5), ADA, SAGA, STAGA, PCAF, TFHD, 
and TFIIIC (Lusser, 2001, Table 1, which is incorporated herein by reference). A 
recruitment factor may be modified to include a DNA binding region, for example as 
outlined in Figure 17, [or a native recruitment factor may be utilized to target proteins 
that interact with genes in their native context]. An example, which is not to be 
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considered limiting in any manner, bnKCPl, or active fragments thereof (see 
Example 4) can target transcription factors that are known to bind DNA. Examples of 
such transcription factors include ERF (Hart et al., 1993), SEBF (Boyle and Brisson, 
2001), or CBF (Stockinger et al., 1997). In this manner by over expressing bnKCPl, 
regulation of the expression of a gene that is dependant on ERF, CBF or SEBF 
activity may be regulated. Another non-limiting example of a recruitment factor is 
BnSCLl, or active fragments thereof (see Example 5). An example, which is not to 
be considered limiting, of a protein that interacts with bnKCPl and BnSCLlis the 
chromatin remodelling protein HDAC, for example HDA1 9. 

By the term "chromatin remodelling protein" it is meant a protein that is 
capable of altering the structure of chromatin. Preferably the chromatin remodelling 
protein is histone acetyl transferase (HAT) or histone deacetylase (referred to either as 
HD, HDA, or HDAC). Any HAT protein, HDAC protein, or any derivative of any 
HAT protein or HDAC protein may be used in the method of the present invention 
provided that the HAT protein, HDAC protein or derivative thereof exhibits the 
respective histone acetylase, or histone deacetylase activity in plants. 

By the term "HD binding domain" or "histone deacetylase binding domain", it 
is meant a sequence of amino acid residues which interacts with a histone deacetylase 
enzyme through protein-protein interactions. Such protein-protein interactions can be 
monitored in several ways, for example, which is not to be considered limiting, by 
yeast two-hybrid experiments. Non-limiting examples of proteins comprising a HD 
binding domain include bnKCPl and BnSCLl. 

By the term "DNA binding protein or portion of a DNA binding protein" it is 
meant a protein or amino acid sequence capable of binding to a specific operator 
sequence. By "operator sequence" it is meant a sequence of DNA that is capable of 
binding to the DNA binding protein or portion of the DNA binding protein. Examples 
of a DNA binding proteins capable of binding specific operator sequences include, 
but are not limited to, the ROS repressor, TET repressor, Sin3, VP16, GAL4, Lex A, 
UMe6, ERF, SEBF and CBF. Any DNA binding protein or portion of any DNA 
binding protein may be employed in the method of the present invention provided that 
the protein or portion thereof is capable of binding to an operator sequence. As an 



WO 03/104462 _ PCT/CA03/00822 

22 



example, but not to be considered limiting in any manner, the ROS repressor may be 
employed in the method of the present invention. By ROS repressor it is meant any 
ROS repressor, analog or derivative thereof as known within the art which is capable 
of binding to an operator sequence. These include ROS repressors as described herein, 

5 as well as other microbial ROS repressors, for example but not limited to ROSAR 
(Agrobacterium radiobacter; Brightwell et al., 1995) (SEQ ID NO:64), MucR 
(Rhizobium meliloti; Keller M et al., 1995) (SEQ ID NO:65), and ROSR (Rhizobium 
elti; Bittinger et al., 1997; also see Cooley et al., 1991; Chou et al., 1998; Archdeacon 
J et al., 2000; D'Souza-Ault M. R., 1993; all of which are incorporated herein by 

10 reference) (SEQ ID NO:63). The DNA sequence of ROS, or any other DNA binding 
protein, may be modified to optimize expression within a plant. Examples of ROS 
repressors that may be used as described herein are provided in Figures 1(A) to (C) 
and (SEQ ID NOs: 1-4). 

15 The DNA binding protein, or portion thereof mat exhibits DNA binding 

activity may be fused to a recruitment factor or chromatin remodelling protein as 
described herein. Examples of such fusion proteins can.be prepared, using methods 
known in the art, for example but not limited to the method outlined in Figure 17. 
Figure 17 discloses a strategy for creating fusion between the zinc finger domain of 

20 the ROS repressor and the KID domain of bnKCPl. This involves amplification of 
regions encoding the zinc finger domain of the ROS repressor and the KID domain 
using the following primers: 

zinc finger: The forward primer (zf-F) contains a restriction enzyme site at the 5' end 
and the reverse primer (zf-R) contains 15 nucleotides from the 5' end of the 

25 KID region.; 

KID domain: The forward primer (KID-F) contains 15 nucleotides from the 3' region 

of the zinc finger domain, and the reverse primer (KID-R) contains a 

restriction enzyme site at the 3' end. 
The amplified zinc finger and KID fragments are combined and used as a template for 
30 a new round of PCR amplification where only the forward primer (zf-F) of the zinc 
finger and the reverse primer (KID-R) of the KID domain are used. The two separate 
templates are amplified to create one single in frame fusion fragment encoding the 
zinc finger and KID domains, and containing restriction sites at each end. This 
product is then cloned into a plant expression vector. 
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However, it is to be understood that fusion of a recruitment factor with a DNA 
binding protein may not be required in order to regulate expression of a nucleic acid 
sequence of interest. Recruitment factors are known to bind chromatin remodelling 
5 proteins and factors that directly or indirectly bind DNA. For example, bnKCPl 
(Example 4) exhibits the property of binding ERF. 

Depending upon the chromatin remodelling protein selected, gene expression 
may be up-regulated or down-regulated. For example, which is not to be considered 

10 limiting in any manner, the binding of a fusion protein containing a recruitment factor 
capable of recruiting HAT to a gene, may result in up-regulation of expression of a 
nucleic acid sequence of interest, while a fusion protein that recruits HDAC will result 
in the down-regulation of the expression of a nucleic acid sequence of interest. 
However, it is within the scope of the present invention that modification to the rate of 

15 up-regulation and down-regulation of gene expression may occur depending upon the 
location of the operator sequence that binds the fusion protein. 

The operator sequence is preferably located in proximity to the nucleic acid 
sequence of interest, either upstream of or downstream of the nucleic acid sequence of 

20 interest (see for example Figure 5 A-D). Alternatively, the operator sequence may be 
within the non-coding region of the nucleic acid sequence of interest, for example, but 
not wishing to be limiting, within an intron of the gene. If it is desired to have the 
expression of a nucleic acid sequence of interest reduced or repressed, the operator 
sequence may be located within a nucleotide region that interferes with binding of 

25 transcription factors required for transcription of the nucleic acid sequence of interest, 
for example, interfering with the binding of the RNA polymerase to the nucleic acid 
sequence of interest, or reducing the rate of migration of the polymerase along a 
nucleotide sequence, or both. 

30 M operator sequence may consist of a ininimal sequence required for binding 

of a DNA binding protein or fragment thereof, or it may comprise an inverted repeat 
or palindromic sequences of a specified length. For example, but not wishing to be 
limiting, the ROS operator sequence may comprise 9 or more nucleotide base pairs 
(see Figures 1 (D) and (E)) that exhibits the property of binding a DNA binding 
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domain of a ROS repressor. A consensus sequence of a 10 amino acid region 
including the 9 amino acid DNA binding site sequence is WATDHWKMAR (SEQ ID 
NO: 5; Figure 1 (E)). The last amino acid, "R", of the consensus sequence is not 
required for ROS binding (data not presented). Examples of operator sequences, 
which are not to be considered limiting in any manner, also include, as is the case 
with the ROS operator sequence from the virC or virD gene promoters, a ROS 
operator made up of two 1 lbp inverted repeats separated by TTTA: 

TATATTTCAATTTTATTGTAATATA (SEQ ID NO: 7); or 
the operator sequence of the IPT gene: 

TATAATTAAAATATTAACTGTCGCATT (SEQ ID NO: 8). 

15 However, it is to be understood that analogs or variants of SEQ ID NO's:7, 8 and 5 
may also be used providing they exhibit the property of binding a DNA binding 
domain, preferably, a DNA binding domain of the ROS repressor. For example, but 
not to be considered limiting in any manner, in the promoter of the divergent 
virC/virD genes of Agrobacterium tumefaciens, ROS binds to a 9 bp inverted repeat 
20 sequence in an orientation-independent manner (Chou et al., 1998). The ROS 
operator sequence in the ipt promoter also consists of a similar sequence to that in the 
virC/virD except that it does not form an inverted repeat (Chou et al., 1998). Only the 
first 9 bp are homologous to ROS box in virC/virD indicating that the second 9 bp 
sequence may not be a requisite for ROS binding. Accordingly, the use of ROS 
25 operator sequences or variants thereof that retain the ability to interact with ROS, as 
operator sequences to selectively control the expression of genes or nucleotide 
sequences of interest, is within the scope of the present invention. 

Other operator sequences include sequences known to bind transcription 
factors, for example but not limited to: 
30 TAAGAGCCGCC (SEQ ID NO:9), which is known to bind ERF (in ethylene 

responsive genes; Hart et al., 1993); 
GACTGTCAC (SEQ ID NO:10), which is known to bind to SEBF (in 
pathogenesis responsive genes; Boyle and Brisson, 2001); 
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TACCGACAT (SEQ ID NO: 11) and TGGCCGAC (SEQ ID NO: 12), 
which are known to bind CBF (in low temperature responsive genes; 
Stockinger et al., 1997). 

The transcription factors ERF, SEBF and CBF are example of factors that can be 

targeted by the recruitment factor bnKCPl. 

By "regulatory region" or Regulatory element" it is meant a portion of nucleic 
acid typically, but not always, upstream of the protein coding region of a gene, which 
may be comprised of either DNA or RNA, or both DNA and RNA When a 
regulatory region is active and in operative association with a nucleic acid sequence 
of interest, this may result in expression of the nucleic acid sequence of interest. A 
regulatory element may be capable of mediating organ specificity, or controlling 
developmental or temporal gene activation. A "regulatory region" includes promoter 
elements, core promoter elements exhibiting a basal promoter activity, elements that 
are inducible in response to an external stimulus, elements that mediate promoter 
activity such as negative regulatory elements or transcriptional enhancers. 
"Regulatory region", as used herein, also includes elements that are active following 
transcription, for example, regulatory elements that modulate gene expression such as 
translational and transcriptional enhancers, translational and transcriptional 
repressors, upstream activating sequences, and mRNA instability determinants. 
Several of these latter elements may be located proximal to the coding region. 

In the context of this disclosure, the term "regulatory element" or "regulatory 
region" typically refers to a sequence of DNA, usually, but not always, upstream (5') 
to the coding sequence of a structural gene, which controls the expression of the 
coding region by providing the recognition for RNA polymerase and/or other factors 
required for transcription to start at a particular site. However, it is to be understood 
that other nucleotide sequences, located within introns, or 3' of the sequence may also 
contribute to the regulation of expression of a coding region of interest. Ah example 
of a regulatory element that provides for the recognition for RNA polymerase or other 
transcriptional factors to ensure initiation at a particular site is a promoter element. 
Most, but not all, eukaryotic promoter elements contain a TATA box, a conserved 
nucleic acid sequence comprised of adenosine and mymidine nucleotide base pairs 
usually situated approximately 25 base pairs upstream of a transcriptional start site. A 
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promoter element comprises a basal promoter element, responsible for the initiation 
of transcription, as well as other regulatory elements (as listed above) that modify 
gene expression. 

There are several types of regulatory regions, including those that are 
developmentally regulated, inducible or constitutive. A regulatory region that is 
developmentally regulated, or controls the differential expression of a gene under its 
control, is activated within certain organs or tissues of an organ at specific times 
during the development of that organ or tissue. However, some regulatory regions 
that are developmentally regulated may preferentially be active within certain organs 
or tissues at specific developmental stages, they may also be active in a 
developmentally regulated manner, or at a basal level in other organs or tissues within 
the plant as well. 

An inducible regulatory region is one that is capable of directly or indirectly 
activating transcription of one or more DNA sequences or genes in response to an 
- inducer. In the absence of an inducer the DNA sequences or genes will not be 
transcribed. Typically the protein factor, which binds specifically to an inducible 
regulatory region to activate transcription, may be present in an inactive form which is 
then directly or indirectly converted to the active form by the inducer. However, the 
protein factor may also be absent. The inducer can be a chemical agent such as a 
protein, metabolite, growth regulator, herbicide or phenolic compound or a 
physiological stress imposed directly by heat, cold, salt, or toxic elements or 
indirectly through the action of a pathogen or disease agent such as a virus. A plant 
cell containing an inducible regulatory region may be exposed to an inducer by 
externally applying the inducer to the cell or plant such as by spraying, watering, 
heating or similar methods. Inducible regulatory elements may be derived from either 
plant or non-plant genes (e.g. Gatz, C. and Lenk, I.R.P.,1998; which is incorporated 
by reference). Examples, of potential inducible promoters include, but not limited to, 
teracycline-inducible promoter (Gatz, C.,1997; which is incorporated by reference), 
steroid inducible promoter (Aoyama, T. and Chua, N.H.,1997; which is incorporated 
by reference) and ethanol-inducible promoter (Salter, M.G., et al, 1998; Caddick, 
M.X. et al,1998; which are incorporated by reference) cytokinin inducible IB6 and 
CKI1 genes (Brandstatter, L and Kieber, J.J.,1998; Kakimoto, T., 1996; which are 
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incorporated by reference) and the auxin inducible element, DR5 (Ulmasov, T., et 
al., 1997; which is incorporated by reference). 

A constitutive regulatory region directs the expression of a gene throughout 
5 the various parts of a plant and continuously throughout plant development. 
Examples of known constitutive regulatory elements include promoters associated 
with the CaMV 35S transcript. (Odell et al., 1985), the rice actin 1 (Zhang et al, 
1991), actin 2 (An et ah, 1996), or tms 2 (U.S. 5,428,147, which is incorporated 
herein by reference), and triosephosphate isomerase 1 (Xu et. al., 1994) genes, the 
10 maize ubiquitin 1 gene (Cornejo et al, 1993), the Arabidopsis ubiquitin 1 and 6 genes 
(Holtorf et al, 1995), and the tobacco translational initiation factor 4A gene (Mandel 
et al, 1995). The term "constitutive" as used herein does not necessarily indicate that a 
gene under control of the constitutive regulatory region is expressed at the same level 
in all cell types, but that the gene is expressed in a wide range of cell types even 
1 5 though variation in abundance is often observed. 

The regulatory regions of the first and second nucleotide sequences denoted 
above, may be the same or different. For example, which is not to be considered 
limiting in any manner, the regulatory elements of the first and second genetic 

20 constructs may both be constitutive. In an aspect of an embodiment, the first and 
second nucleotide sequences may be maintained in the same plant. In an alternate 
embodiment the first and second nucleotide sequences are maintained in separate 
plants, a first and a second plant, respectively. The first nucleotide sequence encoding 
a nucleic acid sequence of interest is expressed within the first plant, hi the second 

25 embodiment, the second plant expresses the second nucleic acid sequence encoding 
the fusion protein capable of regulating the expression of the nucleic acid sequence of 
interest within the first plant. Crossing of the first and second plants produces a 
progeny that expresses the fusion protein which regulates the expression of the 
nucleic acid sequence of interest. In this manner the expression of nucleic acid 

30 sequence of interest that is required to maintain parent stocks may be retained within a 
parent plant but not expressed in a progeny plant. Such a cross may produce sterile 
offspring. 
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Alternatively, which is not to he considered limiting in any manner, the 
second regulatory element may be active before, during, or after the first regulatory 
element is active. Similarly, the first regulatory element may be active before, during, 
or after the second regulatory element is active. Other examples, which are not to be 

5 considered limiting, include the second regulatory, element being an inducible 
regulatory element that is activated by an external stimulus so that regulation of gene 
expression may be controlled through the addition of an inducer. The second 
regulatory element may also be active during a specific developmental stage 
preceding, during, or following that of the activity of the first regulatory element. In 

10 this way the expression of the nucleic acid sequence of interest may be repressed or 
activated as desired within a plant. 

By "nucleic acid sequence of interest", "nucleotide sequence of interest" or 
"coding region of interest" it is meant any gene or nucleotide sequence that is to be 
15 expressed within a host organism. Such a nucleotide sequence of interest may include, 
but is not limited to, a gene whose product has an effect on plant growth or yield, for 
example a plant growth regulator such as an auxin or cytokinin and their analogues, or 
a nucleotide sequence of interest may comprise a herbicide or a pesticide resistance 
gene, which are well known within the art. A nucleic acid sequence of interest or a 

20 coding region of interest, may encode an enzyme involved in the synthesis of, or in 
the regulation of the synthesis of, a product of interest, for example, but not limited to 
a protein, or an oil product. A nucleotide sequence of interest or a coding region of 
interest, may encode an industrial enzyme, protein supplement, nutraceutical, or a 
value-added product for feed, food, or both feed and food use. Examples of such 

25 proteins include, but are not limited to proteases, oxidases, phytases, chitinases, 
invertases, lipases, cellulases, xylanases, enzymes involved in oil biosynthesis, etc. 

A nucleotide sequence of interest or a coding region of interest, may also 
encode a pharmaceutically active protein, for example growth factors, growth 
30 regulators, antibodies, antigens, their derivatives useful for immunization or 
vaccination and the like. Such proteins include, but are not limited to, interleukins, 
insulin, G-CSF, GM-CSF, hPG-CSF, M-CSF or combinations thereof, interferons, for 
example, interferon-a, interferon-B, interferon-7, blood clotting factors, for example, 
Factor VIII, Factor DC, or tPA or combinations thereof. If the nucleic acid sequence 
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of interest or a coding region of interest, encodes a product that is directly or 
indirectly toxic to the plant, then by using the method of the present invention, such 
toxicity may be reduced throughout the plant by selectively expressing the nucleic 
acid sequence of interest within a desired tissue or at a desired stage of plant 
5 development. 

A nucleotide sequence of interest or a coding region of interest, may also 
include a gene mat encodes a protein involved in regulation of transcription, for 
example DNA-binding proteins that act as enhancers or basal transcription factors. 
10 Moreover, a nucleotide sequence of interest may be comprised of a partial sequence 
or a chimeric sequence of any of the above genes, in a sense or antisense orientation. 

It is also contemplated that a nucleic acid sequence of interest or a coding 
region of interest, may be involved in the expression of a gene expression cascade, for 
1 5 example but not limited to a developmental cascade. In this embodiment, the nucleic 
acid sequence of interest is preferably associated with a gene that is involved at an 
early stage within the gene cascade, for example homeotic genes. Expression of a 
nucleic acid sequence of interest, for example a repressor of homeotic gene 
expression, represses the expression of a homeotic gene. Expression of the fusion 
20 protein that represses gene expression within the same plant, either via crossing, 
induction, temporal or developmental expression of the regulatory region, as 
described herein, de-represses the expression of the homeotic gene thereby initiating a 
gene cascade. Conversely, using the methods described herein, expression of an 
introduced (i.e. transgenic) homeotic gene may be activated in a selective manner, so 
25 that it is expressed outside of its normal developmental or temporal expression 
pattern, thereby initiating a cascade of developmental events. This may be achieved 
by targeting a chromatin remodelling protein to a desired homeotic gene as described 
herein. 

30 Homeotic genes are well known to one of skill in the art, and include but are 

not limited to, transcription factor proteins and associated regulatory regions, for 
example controlling sequences mat bind AP2 domain containing transcription factors, 
for example but not limited to, APETALA2 (a regulator of meristem identity, floral 
organ specification, seedcoat development and floral homeotic gene expression; 
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Jofuku et al, 1994), CCAAT box-binding transcription factors (e.g. LEC1; WO 
98/37184; Lotan, T. et al., 1998), or the controlling factor associated with PICKLE, a 
gene that produces a thickened, primary root meristem (Ogas, J. et al.,1997). 

5 A nucleic acid sequence of interest or a coding region of interest, may also be 

involved in the control of transgenes across generations, or production of Fl hybrid 
plants with seed characteristics that would be undesirable in the parental line or 
progeny, for example but not limited to, oil seeds characterized as having reduced 
levels of sinapine biosynthesis within the oil-free meal. In this case, a nucleic acid 
10' sequence of interest may be any enzyme involved in the synthesis of one or more 
intermediates in sinipine biosynthesis. An example, which is to be considered non- 
limiting, is caffeic o-methyltransferase (Acc# AAG51676), which is involved in 
ferulic acid biosynthesis. Other examples of genes of interest include genes that 
encode proteins involved in fiber, or glucosinolate, biosynthesis, or a protein involved 
15 in the biosynthesis of a phytotoxin. Phytotoxins may also be used for plant selection 
purposes. In this non-limiting example, a nucleic acid sequence of interest may 
encode a protein that is capable of metabolizing a compound from a non-toxic form to 
a toxic form thereby selectively removing plants that express the nucleic acid 
sequence.of interest. The phytotoxic compound maybe synthesized from endogenous 
20 precursors that are metabolized by the nucleic acid sequence of interest into a toxic 
form, for example plant growth regulators, or the phytotoxic compound may be 
synthesized from an exogenously applied compound that is only metabolized into a 
toxic compound in the presence of the nucleic acid sequence of interest. For example, 
which is not to be considered limiting, the nucleic acid sequence of interest may 
25 comprise indole acetamide hydrolase (IAH), that converts exogenously applied indole 
acetamide (IAM) or naphthaline acetemide (NAM), to indole acetic acid (IAA), or 
naphthaline acetic acid (NAA), respectively. Over-synthesis of IAA or NAA is toxic 
to a plant, however, in the absence of IAH, the applied IAM or NAM is non-toxic. 
Similarly, the nucleic acid sequence of interest may encode a protein involved in 
30 herbicide resistance, for example, but not limited to, phosphinothricin acetyl 
transferase, wherein, in the absence of the gene encoding the transferase, application 
of phosphinothricin, the toxic compound (herbicide) results in plant death. Other 
nucleic acid sequence of interest that encode lethal or conditionally lethal products 
may be found in WO 00/37660 (which is incorporated herein by reference). 
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The nucleic acid sequence of interest, the nucleotide sequence of interest or a 
coding region of interest, may be expressed in suitable eukaryotic hosts which are 
transformed by the nucleotide sequences, or nucleic acid molecules, or genetic 
5 constructs, or vectors of the present invention. Examples of suitable hosts include, but 
are not limited to, insect hosts, mammalian hosts, yeasts and plants. Suitable plant 
hosts include, but are not limited to agricultural crops including canola, Brassica spp., 
maize, tobacco, alfalfa, rice, soybean, wheat, barley, sunflower, and cotton. 

10 The one or more chimeric genetic constructs of the present invention can 

further comprise a 3' untranslated region. A 3' untranslated region refers to that 
portion of a gene comprising a DNA segment that contains a polyadenylation signal 
and any other regulatory signals capable of effecting mRNA processing or gene 
expression. The polyadenylation signal is usually characterized by effecting the 

15 addition of polyadenylic acid tracks to the 3' end of the mRNA precursor. 
Polyadenylation signals are commonly recognized by the presence of homology to the 
canonical form 5-AATAAA-3' although variations are not uncommon. One or more 
of the chimeric genetic constructs of the present invention can also include further 
enhancers, either translation or transcription enhancers, as may be required. These 

20 enhancer regions are well known to persons skilled in the art, and can include the 
ATG initiation codon and adjacent sequences. The initiation codon must be in phase 
with the reading frame of the coding sequence to ensure translation of the entire 
sequence. 

25 Examples of suitable 3' regions are the 3' transcribed non-translated regions 

containing a polyadenylation signal of Agrobacterium tumor inducing (Ti) plasmid 
genes, such as the nopaline synthase (Nos gene) and plant genes such as the soybean 
storage protein genes and the small subunit of the ribulose-1, 5-bisphosphate 
carboxylase (ssRUBISCO) gene. 

30 

To aid in identification of transformed plant cells, the constructs of this 
invention may be further manipulated to include selectable markers. Useful selectable 
markers in plants include enzymes which provide for resistance to chemicals such as 
an antibiotic for example, gentamycin, hygromycin, kanamycin, or herbicides such as 
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phosphinothrycin, gljphosate, chlorosulfuron, and the like, similarly, enzymes 
providing for production of a compound identifiable by colour change such as GUS 
(^-glucuronidase), or luminescence, such as luciferase or GFP, are useful. 

5 Also considered part of this invention are transgenic eukaryotes, for example 

but not limited to plants containing the chimeric gene construct of the present 
invention. However, it is to be understood that the chimeric gene constructs of the 
present invention may also be combined with nucleic acid sequence of interest for 
expression within a range of eukaryotic hosts. 

10 

In instances where the eukaryotic host is a plant, methods of regenerating 
whole plants from plant cells are also known in the art. In general, transformed plant 
cells are cultured in an appropriate medium, which may contain selective agents such 
as antibiotics, where selectable markers are used to facilitate identification of 

15 transformed plant cells. Once callus forms, shoot formation can be encouraged by 
employing the appropriate plant hormones in accordance with known methods and the 
shoots transferred to rooting medium for regeneration of plants. The plants may then 
be used to establish repetitive generations, either from seeds or using vegetative 
propagation techniques. Transgenic plants can also be generated without using tissue 

20 cultures (for example, Clough and Bent, 1998). 

The constructs of the present invention can be introduced into plant cells using 
Ti plasmids, Ri plasmids, plant virus vectors, direct DNA transformation, micro- 
injection, electroporation, etc. For reviews of such techniques see for example 
25 Weissbach and Weissbach, 1988; Geierson and Corey, 1988; and Miki and Iyer,- 
1997; Clough and Bent, 1998). The present invention further includes a suitable 
vector comprising the chimeric gene construct. 

The DNA binding protein which is employed in the method of the present 
30 invention may be naturally produced in an organism other than a plant. For example, 
but not wishing to be considered limiting, a ROS repressor is encoded by a nucleotide 
sequence of bacterial origin and, as such the nucleotide sequence may be optimised, 
for example, by changing its codons to favour plant codon usage, by attaching a 
nucleotide sequence encoding a nuclear localisation signal (NLS), for example but not 
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limited to SV40 lodWSation signal (see Robbins et al 9 1991; l3zzo,R, et al., 1991; 
which are incorporated herein by reference) in order to improve the efficiency of ROS 
transport to the plant nucleus to facilitate the interaction with its respective operator, 
or both optimizing plant codon usage. Addition of an NLS to a fusion protein 
comprising a binding domain, for example the ROS repressor binding domain, and a 
recruitment factor, may also ensure targeting of the fusion product to the nuclear 
compartment. Similar optimization may be performed for other DNA binding proteins 
of non-plant source, however, such optimization may not always be required. Other 
possible nuclear localization signals that may be fused to a DNA binding protein 
include but are not limited to those listed in Tablel: 



Table 1: nuclear localization signals 
Nuclear Protein Organism 



NLS 



Ref 



15 


AGAMOUS 


A 


RienttnrqvtfcKRR (SEQ ID NO: 1 3) 1 




TGA-1A 


T 


RRlaqnreaaRKsRIRKK (SEQ ID NO: 14) 2 




TGA-1B 


T 


KKRaRlvrnresaqlsRqRKK (SEQ ID NO: 1 5)2 




02NLSB 


M 


RKRKesnresaRRsRyRK (SEQ ID NO:16) 3 




NIa 


V 


KKnqfchklkm-32aa-KRK (SEQ ID NO: 17) 4 


20 


Nucleoplasmin 


X 


KRpaatkkagqaKKKKl (SEQ ID NO: 18) 5 




N038 


X 


KRiapdsaskvpRKKtR (SEQ ID NO: 19) 5 




N1/N2 


X 


KRKteeesplKdKdaKK (SEQIDNO:20) 5 




Glucocorticoid 








receptor 


M,R 


RkclqagmnleaRKtKK (SEQIDNO:21) 5 


25 


a receptor 


H 


RKclqagmnleaRKtKK (SEQ ID NO:22) 5 




6 receptor 


H 


RKclqagmnleaRKtKK (SEQIDNO:23) 5 




Progesterone receptor C,H,Ra 


RKccqagmvlggRKfKK (SEQIDNO:24) 5 




Androgen receptor 


H 


RKcyeagmtlgaRKlKK (SEQIDNO:25) 5 




p53 


C 


RRcfevrvcacpgRdRK (SEQIDNO:26) 5 


30 









+ A, Arabidopsis; X, Xenopus; M, 
tobacco; M, maize; V, potyvirus. 
References: 

1. Yanovsky et al. 9 1990 



mouse; R, rat; Ra, rabbit; H, human; C, chicken; T, 
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Incorporation of a nuclear localization signal into the fusion protein of the 
present invention may facilitate migration of the fusion protein, into the nucleus. 
Without wishing to be bound by theory, reduced levels of fusion proteins elsewhere 

1 0 within the cell may be important when the DNA binding portion of the fusion protein 
may bind analogue operator sequences within other organelles, for example within the 
mitochondrion or chloroplast. Furthermore, the use of a nuclear localization signal 
may permit the use of a less active promoter or regulatory region to drive the 
expression of the fusion protein while ensuring that the concentration of the expressed 

1 5 protein remains at a desired level within the nucleus, and that the concentration of the 
protein is reduced elsewhere in the cell. 

Referring now to Figures 2A-C, there is shown aspects of an embodiment of 
the method of the present invention. Shown in Figure 2A are two constructs which 
20 have been introduced within a plant cell. The constructs comprise: 

1) a first nucleotide sequence (10) comprising, 

a) a nucleic acid sequence of interest (20) operatively linked to a first 
regulatory region (30); 

b) an operator sequence (40) capable of binding a fusion protein (85, 
25 Figure 2B), and; 

2) a second nucleotide sequence (60) comprising a second regulatory region 
(70) in operative association with a nucleotide sequence (80) encoding a 
fusion protein (85). 

The fusion protein (Figure 2B; 85) encoded by nucleotide sequence (80) comprises 
30 a) a DNA binding protein (100), or a portion of a DNA binding protein 

capable of binding the operator sequence (40, Figure 2A), and; 
b) a recruitment factor protein (110), or a portion of a recruitment 
factor protein capable of binding a chromatin remodelling protein 
(120), for example but not limited to histone deacetylase, HDAC. 
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In the example shown in Figure 2 A-C, the operator sequence (40) is shown as being 
upstream from the regulatory region (30), however, the operator sequence may also be 
positioned downstream from the regulatory region (40), for example between the 
regulatory region (40) and the nucleic acid sequence of interest (20; see for example 
the constructs in Figure 5A-D), within the coding region of the nucleic acid sequence 
of interest (20), or downstream of the nucleic acid sequence of interest (20). 

Referring now to Figure 2C, but without wishing to be bound by theory, 
transcription and translation of nucleotide sequence (60; Figure 2A) produces fusion 
protein (80; Figure 2B) which is capable of binding operator sequence (40; Figure 
2A) and for example, histone deacetylase (120). Dual binding of histone deacetylase 
(120) to fusion protein (85) and fusion protein (85) to operator sequence (40) 
facilitates enzymatic deacetylation of histones (via bound histone deacetylase) in 
proximity of the nucleic acid sequence of interest (20) thereby causing repression of 
the nucleic acid sequence of interest (20). 

The first (10) and second (60) nucleotide sequences may be placed within the 
same or within different vectors, genetic constructs, or nucleic acid molecules. 
Preferably, the first nucleotide sequence and the second nucleotide sequence are 
chromosomally integrated into a plant or plant cell. The two nucleotide sequences 
may be integrated into two different genetic loci of a plant or plant cell, or the two 
nucleotide sequences may be integrated into a singular genetic locus of a plant or 
plant cell. However, the second nucleotide sequence may be integrated into the DNA 
of the plant or it may be present as an extra-chromosomal element, for example, but 
not wishing to be limiting a plasmid. Furthermore, the first and second regulatory 
regions may be the same or different, and maybe active in a constitutive, temporal, 
developmental or inducible manner. 

Referring now to Figures 3A-C, there is shown aspects of an alternate 
embodiment of the method of the present invention. Shown in Figure 3A are two 
constructs which have been introduced into a plant cell. The constructs comprise: 
1) a first nucleotide sequence (10) comprising, 

a) a nucleic acid sequence of interest (20) operatively linked to a 
regulatory region (30), 
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b) an operator sequence (40) capable of binding a fusion protein (85, 
Figure 3B), and; 

2) a second nucleotide sequence (60) comprising a regulatory region (70) in 
operative association with a nucleotide sequence (80) encoding a fusion 
5 protein (85). 

The fusion protein (85) encoded by nucleotide sequence (80) comprises, 

a) a DNA binding protein (100), or a portion of a DNA binding protein 
capable of binding the operator sequence (40), and; 

b) a recruitment factor protein (110), or a portion of a recruitment 
10 factor protein capable of binding a chromatin remodelling protein, for 

example but not limited, to free histone acetyltransferase (HAT) (120). 
In the example shown in Figure 3 A-C, the operator sequence (40) is shown as being 
upstream from the regulatory region (30), however, the operator sequence may also be 
positioned downstream from the regulatory region (40), for example between the 
15 regulatory region (40) and "the nucleic acid sequence of interest (20; see for example 
the constructs in Figure 5A-D), within the coding region of the nucleic acid sequence 
of interest (20), or downstream of the nucleic acid sequence of interest (20). 

Referring now to Figure 3C, but without wishing to be bound by theory, 
20 transcription and translation of nucleotide sequence (80; Figure 3A) produces fusion 
protein (85; Figure 3B) which is capable of binding operator sequence (40; Figure 
3A) and free histone acetyltransferase (120). Dual binding of histone acetyltransferase 
(120) to fusion protein (85) and fusion protein (85) to operator sequence (40) 
facilitates enzymatic acetylation of histones (via bound histone acetyltransferase) in 
25 proximity of the nucleic acid sequence of interest (20) thereby causing an increase in 
the transcription of the nucleic acid sequence of interest (20). 

The present invention also relates to a method of enhancing the expression of 
a nucleic acid sequence of interest or enhancing the transcription of one or more 
30 selected nucleotide sequences by transforming a plant with one or more constructs 

comprising: 

1) a first nucleotide sequence comprising, 

a) a nucleic acid sequence of interest operatively linked to a regulatory 

region, and; 
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b) an operator sequence that interacts with a fusion protein; 
2) a second nucleotide sequence comprising a regulatory region in operative 
association with a nucleotide sequence encoding a fusion protein comprising, 

a) a DNA binding protein, or a portion of a DNA binding protein 
5 capable of binding the operator sequence, and; 

b) a histone acetyltransferase (HAT) protein, or portion of a histone 
acetyltransferase protein which is capable of increasing histone 
acetylation; 

and wherein binding of the fusion protein to the operator sequence increases 
10 histone acetylation in the proximity of the nucleic acid sequence of interest within the 
first nucleotide sequence thereby increasing the transcription of the nucleic acid 
sequence of interest. 

These first and second nucleotide sequences may be placed within the same or 
15 within different vectors, genetic constructs, or nucleic acid molecules. Preferably, the 
first nucleotide sequence and the second nucleotide sequence are chromosomally 
integrated into a plant or plant cell. The two nucleotide sequences may be integrated 
into two different genetic loci of a plant or plant cell, or the two nucleotide sequences 
may be integrated into a singular genetic locus of a plant or plant cell. However, the 
20 second nucleotide sequence may be integrated into the DNA of the plant or it may be 
present as an extra-chromosomal element, for example, but not wishing to be limiting 
a plasmid, or transiently expressed, for example when using viral vectors, bioloistics 
for transformation. 

25 Preferably, the operator sequence is located in a nucleotide region that does 

not sterically hinder binding of transcription factors to the regulatory region, binding 
of the RNA polymerase to the nucleic acid sequence of interest, or migration of the 
polymerase along the DNA of the first nucleotide sequence, nucleic acid sequence of 
interest or both. 

30 

Referring now to Figures 4A-C, there is shown aspects of an embodiment of 
the method of the present invention. Shown in Figure 4A are two constructs which 
have been introduced within a plant cell. The constructs comprise: 

1) a first nucleotide sequence (10) comprising, 
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a) a nucleic acid sequence of interest (20) operatively linked to a 
regulatory region (30), 

b) an operator sequence (40) capable of binding a fusion protein (85), 
and; 

5 2) a second nucleotide sequence (60) comprising a regulatory region (70) in 

operative association with a nucleotide sequence (80) encoding a fusion 
protein (85). 

The fusion protein (85) encoded by nucleotide sequence (80) comprises 
10 a) a DNA binding protein (1 00), or a portion of a DNA binding protein 

capable of binding the operator sequence (40), and; 

b) a histone acetyltransferase protein (130), or a portion of a bistone 
acetyltransferase protein. 

15 Referring now to Figure 4C, but without wishing to be bound by theory, 

transcription and translation of nucleotide sequence (80; Figure 4A) produces fusion 
protein (85; Figure 4B) which comprises an active HAT protein (130), or portion 
thereof. Binding of the fusion protein (85) to the operator sequence facilitates 
enzymatic acetylation of histones in proximity to the nucleic acid sequence of interest 

20 (20) thereby enhancing the expression of a nucleic acid sequence of interest. 

In the example shown in Figure 4 A-C, the operator sequence (40) is shown as 
being upstream from the regulatory region (30), however, the operator sequence may 
also be positioned downstream from the regulatory region (40), for example between 
25 the regulatory region (40) and the nucleic acid sequence of interest (20; see for 
example the constructs in Figure 5A-D), within the coding region of the nucleic acid 
sequence of interest (20), or downstream of the nucleic acid sequence of interest (20). 

Also contemplated by the present invention is the control of gene expression 
30 accomplished through combinations of activator, effector and gene of interest 
constructs as outlined in Figures 29 A and B (see Example 6). With reference to 
Figure 29A, the expression of a gene of interest (reporter) is regulated using three 
constructs: 

a reporter construct (or gene of interest construct), 
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an activator construct and 
an effector construct. 

The gene of interest construct includes a gene of interest, for example but not limited 
to a reporter gene (e.g. the lacZ gene), in operative association with a regulatory 
5 element and an operator sequence. 

The activator construct comprises a nucleic acid sequence encoding a 
recruitment factor protein, or a portion thereof, capable of binding a chromatin 
remodelling protein, fused with a nucleotide sequence encoding a DNA binding 
10 protein, or a fragment thereof. The recruitment factor protein may be, for example 
but not limited to BnSCLl, bnKCPl or an active fragment thereof; the DNA binding 
protein could be, for example but not limited to VP 16 or GAL4 DNA Binding 
domain. In this case the activator construct produces a VP16-bNSCLl fusion protein. 

15 The effector plasmid includes a nucleic acid sequence encoding a chromatin 

remodelling factor, for example but not limited to HDA19, operatively associated 
with a regulatory element and a nucleic acid sequence encoding a nuclear localisation 
signal. The constructs are expressed in eukaryotes, for example plant, animal or 
yeast. 

20 ■ 

When the activator construct is co-expressed with the gene of interest 
(reporter) construct, the DNA binding sequence binds the operator sequence of the 
gene of interest construct. This results in modification in the expression of the gene 
of interest due to interaction of the activator protein within the transcriptional 
25 machinery. In this example, the activator protein is fused to a recruitment factor 
protein, and the VP16-BnSCLl fusion protein binds the Tet operator sequence of the 
gene activator construct resulting in increased expression of the gene of interest. 

Co-expression of the effector construct, inconjucntion with the gene of interst 
30 and activator constructs, results in synthesis of a chromatin remodelling factor, in this 
case HAD 19, which associates with the recruitment factor protein, BnSCLl. 
Association of HDAC with the construct expressing the gene of interst, reduces 
expression of the gene of interest. 
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In a second aspect, the expression of a gene of interest is regulated using two 
constructs: a gene of interest (reporter)+activator and an effector construct as shown 
in Figure 29B. Expression of the reporterf activator construct results in an increased 
expression of the gene of interest due to binding of the activator portion of the 
5 construct to the operator sequence of the gene of interest construct. This association 
may be inhibited in the presence of tetracycline. As in the case outlined with 
reference to Figure 29A, above, co-expression of the effector construct results in 
reduced expression of the gene of interest due to association of HDAC to the 
activator-recruitment factor fusion protein (VP16-BnSCLl fusion) 

10 

The present invention also provides for a method to regulate expression of a 
nucleic acid sequence of interest, wherein the nucleic acid sequence of interest 
comprises an endogenous sequence. In this embodiment, a nucleotide sequence 
comprising a regulatory region in operative association with a nucleotide sequence 

15 encoding a recruitment factor, or a portion thereof, that is known to interact with a 
factor that binds the nucleic acid sequence of interest, is expressed in the host. The 
recruitment factor protein, or a portion thereof is capable of binding a chromatin 
remodelling protein, for example but not limited, HDAC or HAT, and the recruitment 
factors also interacts with endogenous factors that bind the nucleotide sequence of 

20 interest (e.g. transcription factors). In this manner, expression of the recruitment 
factor in a temporal, tissue specific, or induced manner will result in the expression of 
the recruitment factor that binds the chromatin remodelling factor and 
transcription factor resulting in modulation of expression of the nucleic acid sequence 
of interest. A non-limiting example of this embodiment includes the expression of 

25 bnKCPl and its interaction with HDAC and transcription factors ERF, SEBF or CBF. 

Therefore, the present invention provides a method to regulate expression of 
an endogenous nucleic acid sequence of interest in a plant comprising: 

i) introducing into the plant a nucleotide sequence comprising, a regulatory region, 
30 operatively linked with a nucleotide sequence encoding a recruitment factor protein, 
the recruitment factor protein capable of binding an endogenous DNA binding 
protein, the endogenous DNA binding protein characterized in binding a segment of a 
DNA sequence of the endogenous nucleotide sequence of interest, and; 
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ii) growing the plant, wherein expression of the nucleotide sequence produces the 
recruitment factor thereby regulating expression of the endogenous nucleic acid 
sequence of interest 

5 An alternate embodiment of the present invention includes a method to 

regulate expression of an endogenous nucleic acid sequence of interest. In this 
example, a DNA binding protein, or a portion thereof, known to interact with the 
DNA of an endogenous nucleic acid sequence of interest is fused to a chromatin 
remodelling factor. Expression of the fusion protein permits the recruitment factor 

10 portion of the fusion protein to interact or bind with a chromatin remodelling, for 
example but not limited to HDAC or HAT, and the DNA binding portion of the fusion 
protein binds the nucleotide sequence of interest. In this manner, expression of the 
fusion protein in a temporal, tissue specific, or induced manner will result in the 
expression of a recruitment factor that binds a chromatin remodelling factor and the 

15 DNA of a nucleic acid sequence of interest, resulting in modulation of expression of 
the endogenous nucleic acid sequence of interest. Examples of DNA binding 
proteins, or portions thereof, that bind endogenous nucleic acid sequences of interest, 
which are not to be considered limiting, include ERF, SEBF or CBF. A non-limiting 
example of a recruitment factor is bnKCPl or BnSCLl . 

20 

Therefore, the present invention also provides a method to regulate expression 
of an endogenous nucleic acid sequence of interest in a plant comprising: 

i) introducing into the plant a nucleotide sequence comprising, a regulatory region, 
operatively linked with a nucleotide sequence encoding a fusion protein, the fusion 
25 protein comprising, 

a) a DNA binding protein, or a portion thereof, capable of binding a segment 
of a DNA sequence of the endogenous nucleotide sequence of interest, and; 

b) a recruitment factor protein, or a portion thereof, capable of binding a 
chromatin remodelling protein; and 

30 ii) growing the plant, wherein expression of the nucleotide sequence produces the 
fusion protein that regulates expression of the endogenous nucleic acid sequence of 
interest. 



WO 03/104462 PCT/CA03/00822 

Also contemplated by the present invention is a method of increasing cold 
tolerance in a plant. The method comprises providing a plant having a nucleotide 
sequence of interest operatively linked to a first regulatory region; the nucleotide 
sequence of interest encodes bnKCPl, or a fragment thereof. The plant is maintained 
5 under conditions where bnKCPl is expressed. In this manner, the plant expressing 
bnKCPl is preconditioned for cold adaptation and exhibits increased cold tolerance. 

By the term cold in the context of cold tolerance, it is meant a temperature in 
the range of about -10°C to about 10°C. An example of cold temperature, without 
10 wishing to be limiting, is a temperature in the range of about -8°C to about 8°C; a 
further example is a temperature of about -1 0 to about - 1 °C. 

Sequences of the present invention are listed in Table 2. 

15 Table 2 



SE0IDNO:l 


aa seq of wild-type ROS (A. tumefaciens) 


Fig 1 A CWT-ROS 


SEQIDNO:2 


Nucl seq synthetic ROS optimized for plant, with NLS 


Fig IB 


SEQIDNO:3 


Consensus nucl seq of composite ROS 


FiglC 


SE0IDNO:4 


aa seq of synthetic ROS 


Fig 1A, 1C 


SEQ ID NO:5 


ROS binding sequence 


Fig IE 


SE0EDNO:6 


aa seq of NLS (PKKKRKV) 




SEQE>NO:7 


ROS operator sequence 




SEQIDNO:8 


EPT gene operator sequence 




SE0IDNO:9 


Operator sequence binding to ERF 




SEQ ID NO: 10 


Operator sequence binding to SEBF 




SEQ ED NO: 11 


Operator sequence binding to CBF 




SEQ ID NO: 12 


Operator sequence binding to CBF 




SEQ ID NO: 13 


NLS of AGAMOUS protein 


Table 1, page 30 


SEQIDNO:14 


NLS of TGA-1 A protein 


Table 1, page 30 


SEQ ID NO: 15 


NLS of TGA-1B protein 


Table l,jgage30 


SEQIDNO:16 


NLS of 02 NLS B protein 


Table 1, page 30 


SEQIDNO:17 


"NTT .S of NTa protein 


Table Ljpage30 


SEQ ID NO: 18 


NLS of nucleoplasmin protein 


Table 1, page 30 


SEQIDNO:19 


NLS of N038 protein 


Table 1, page 30 


SEQ ID NO:20 


NLS of N1/N2 protein 


Table Ljpage 30 


SEQ ID NO:21 


NLS of Glucocorticoid receptor 


Table 1, page 30 


SEQ ID NO:22 


NLS of Glucocorticoid a receptor 


Table 1, page 30 


SEQ ID NO:23 


NLS of Glucocorticoid b receptor 


Table 1, page 3C 


SEQ ID NO:24 


NLS of Progesterone receptor 


Table 1, page 3C 


SEQIDNO:25 


NLS of Androgen receptor 


Table 1, page 3C 
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SEQ ID NO:26 


NLS of p53 protein 


Table 1, page 30 


SEQ ID NO:27 


VirC/VirD operator seq • 


Fig ID 


SEQIDNO:28 


ROS-OPDS, p74-315 




SEQ ID NO:29 


ROS-OPDA,p74-315 




SEQ ID NO:30 


ROS-OPUS, p74-316 




SEQIDNO:31 


ROS-OPUA,p74-316 




SEOIDNO:32 


ROS-OPPS,p74-309 




SEOIDNO:33 


ROS-OPPA, p74-309 




SEQIDNO:34 


ROS-OPl,p74-508 




SEOIDNO:35 


ROS-OP2, p74-508 




SEO ID NO:36 


tms2 promoter sense primer, p74-508 




SEQ ID NO:37 


tms2 promoter anti-sense primer, p74-508 




SEQIDNO:38 


Actin2 promoter sense primer, p74-501 




SEQIDNO:39 


Actin2 promoter anti-sense primer, p74-501 




SEQ ID NO:40 


p74-315 seq from EcoRV to ATG of GUS 




SEQ ID NO:41 


P74-316 seq from EcoRV to ATG of GUS 




SEQIDNO:42 


p74-309 seq from EcoRV to ATG of GUS 




SEQ ID NO:43 


p74-l 18 seq from EcoRV to ATG of GUS 




SEQ ID NO:44 


Forward primer for HDA19 A. thaliana, pDBLeu-HDAl 9 




SEQ ID NO:45 


Reverse primer for HDA19 A. thaliana, pDBLeu-HDAl 9 




SEQ ID NO:46 


Forward primer for Gcn5 Arabidopsis, GST-Gcn5 




SEQ ID NO:47 


Reverse primer for Gcn5 Arabidopsis, GST-Gcn5 




SEQ ID NO:48 


Reverse primer for HDA19, GST-HDA19 




SEQIDNO:49 


Forward primer for bnKCPl, 1-80, 1-160 (generation of 
mutants) 




SEQ ID NO:50 


Reverse primer for bnKCPl 1-160 (generation of mutants) 




SEQIDNO:51 


Reverse primer for bnKCPl 1-80 (generation of mutants) 




SEQ ID NO:52 


Reverse primer for bnKCPl (generation of mutants) 




SEQ ID NO:53 


Forward primer for bnKCPl, 1-80 and 1-160 
(in vivo assay and transactivation assay) 




SEQ ID NO:54 


Reverse primer for bnKCPl (in vivo assay and 
transactivation assay) and 81-215 (transactivation assay) 




SEQ ID NO:55 


Reverse primer for bnKCP 1 1- 1 60 

(in vivo assay and transactivation assay) 




SEQ ID NO:56 


Reverse primer for bnKCPl 1-80 

(in vivo assay and transactivation assay) 




SEQ ID NO:57 


Forward primer for bnKCPlGl 88 




SEQ ID NO:58 


Reverse primer for bnKCPlGl 88 




SEQ ID NO:59 


Forward primer for bnKCPl 81-215 (transactivation assay) 




SEQ ID NO:60 


Forward primer for entire coding region of bnKCPl 




SEQ ID NO:61 


Reverse primer for entire coding region of bnKCPl 




SEQ ID NO:62 


pat7 NLS (PLNKKRR) 




SEQ ID NO:63 


aa seq of ROSR (ROS repressor) 


FiglA 


SEQ ID NO:64 


aa seq of ROSAR (ROS repressor) 


FiglA 


SEQ ID NO:65 


aa seq of MucR (ROS repressor) 


Fig 1A 


SEQ ID NO:66 


VirC/VirD DNA binding site seq (1) 


Fig ID 


SEQ ID NO:67 


VirC/VirD DNA binding site seq (2) 


Fig ID 


SEQ ID NO:68 


ipt DNA binding site seq (1) 


Fig ID 
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SEQ ID NO:69 


ipt DNA binding site seq (2) 


Fig ID 


SEQ ID NO:70 


Consensus DNA binding site seq 


Fig ID 


SEQ ID N0:71 


bnKCP aa seq 


Fig 10A 


SEQ ID NO:72 


atKCP aa seq 


Fig 10A 


SEQ ID NO:73 


atKCLl aaseq 


Fig 10A 


SEQ ID NO:74 


atKCL2 aa seq 


Fig 10A 


SEQ ID NO:75 


bnKCP aa seq 


Fig 10B 


SEQ ED NO:76 


ATF-1 aa seq 


Fig 10B 


SEQ ID NO:77 


hyCREB aa seq 


Fig 10B 


SEQIDNO:78 


CREB aa seq 


Fig 10B 


SEQ ID NO:79 


CREM aa seq 


Fig 10B 


SEQIDNO:80 


cCREM aa seq 


Fig 10B 


SEQIDNO:81 


aaseq of BnSCLl 


Fig 20 


SEQ ID NO:82 


aaseq of atSCL15 


Fig 20 


SEQ ID NO:83 


aaseq of IsSCR 


Fig 20 


SEQ TD NO:84 


BnSCLl sense primer 




SEQJDNO:85 


BnSCLl anti-sense primer 




SEQ ID NO:86 


BnlAAl sense primer 




SEQIDNO:87 


BnlAAl anti-sense primer 




SEQIDNO:88 


BnIAA12 sense primer 




SEQIDNO:89 


BnlAAl 2 anti-sense primer 




SEQE)NO:90 


Forward primer for BnSCLl, BnSCLl 1 - 33 *, BnSCLl' ' 01 , 
BnSCLl™ 17 and BnSCLl 1 ' 145 for pET-28b vector 




SEQ ID NO:91 


Reverse primer for BnSCLl for pET-28b vector 




SEQIDNO:92 


Reverse primer for BnSCLl 1 -"* for pET-28b vector 




SEQIDNO:93 


Reverse primer for BnSCLl 1201 for pET-28b vector 




SEQ ID NO:94 


Reverse primer for BnSCLl 1 -' 1 ' for pET-28b vector 




SEQ ID NO:95 


Reverse primer for BnSCLl 1 ' 143 for pET-28b vector 




SEQ ID NO:96 


Forward primer for BnSCLl, BnSCLl 133 *, BnSCLl 1 " 01 , 
BnSCLV' 217 and BnSCLl 1 - 145 for pPC86 vector 




SEQ ID NO:97 


Forward primer for BnSCLl 140 - 33 * for PC86 vector 




SEQ ID NO:98 


Forward primer for BnSCLl' 1 *- 434 for PC86 vector 




SEQIDNO:99 


Reverse primer for BnSCLl and BnSCLl' 1 *- 434 for PC86 
vector 




SEQ ID NO: 100 


Reverse primer for BnSCLl 1 - 33 * for PC86 vector 




SEQ1DNO:101 


Reverse primer for BnSCLl 1 - 101 for PC86 vector 




SEQ ID NO: 102 


Reverse primer for BnSCLl 1 '' 1 ' for PC86 vector 




SEQ ID NO: 103 


Reverse primer for BnSCLl 1 ' 143 for PC86 vector 




SEQ ID NO: 104 


aa seq of LXXLL motif ( ,48 LGSLL 1 ^) 





The above description is not intended to limit the claimed invention in any 
manner, furthermore, the discussed combination of features might not be absolutely 
5 necessary for the inventive solution. 
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The present invention will be further illustrated in the following examples. 
However it is to be understood that these examples are for illustrative purposes only, 
and should not be used to limit the scope of the present invention in any manner. 

S Examples 

Materials and Methods 

Plant Material 



Wild type Arabidopsis thaliana, ecotype Columbia^ seeds were germinated on 
RediEarth (W.R. Grace & Co.) soil in pots covered with window screens under green 
house conditions (~25°C, 16 hr light). Emerging bolts were cut back to encourage 
further bolting. Plants were used for transformation once multiple secondary bolts had 
15 been generated. 

Plant Transformation 



20 described in Clough and Bent (1998). Essentially, Agrobacterium tumefaciens 
transformed with the construct of interest (using standard methods as known in the 
art) was grown overnight in a 100ml Luria-Bertani Broth (10 g/L NaCl, 10 g/L 
tryptone, 5 g/L yeast extract) containing 50 ptg/ml kanamycin. The cell suspension 
culture was centrifuged at 3000 X g for 15 min. The pellet was resuspended in 1L of 

25 the transformation buffer (sucrose (5%), Silwet L77 (0.05%)(Loveland Industries). 
The above-ground parts of the Arabidopsis plants were dipped into the Agrobacterium 
suspension for -1 min and the plants were then transferred to the greenhouse. The 
entire transformation process was repeated twice more at two day intervals. Plants 
were grown to maturity and seeds collected. To select for transformants, seeds were 

30 surface sterilized by washing in 0.05% Tween 20 for 5 minutes, with 95% ethanol for 
5 min, and then with a solution containing sodium hypochlorite (1.575%) and Tween 
20 (0.05%) for 10 min followed by 5 washings in sterile water. Sterile seeds were 
plated onto either Pete Lite medium (20-20-20 Peter's Professional Pete Lite fertilizer 
(Scott) (0.762 g/1), agar (0.7%), kanamycin (50 fig/ml), pH 5.5) or MS medium (MS 



10 



Plant transformation was carried out according to the floral dip procedure 



WO 03/104462 PCT/CA03/00822 



46 




salts (0.5X)(Sigma), B5vitamins (IX), agar (0.7%), kanamycin (50 fig/ml) pH 5.7). 
Plates were incubated at 20°C, 16hr light/ 8 hr dark in a growth room. After 
approximately two weeks, seedlings possessing green primary leaves were transferred 
to soil for further screening and analysis.* 

5 

Example 1: Optimization of ROS protein coding region. 

The ros nucleotide sequence is derived from Agrobacterium tumefaciens (SEQ 
ID NO:l; Figure 1A). Analysis of the protein coding region of the ros nucleotide 

10 sequence indicates that the codon usage may be altered to better conform to plant 
translational machinery. The protein coding region of the ros nucleotide sequence 
was therefore modified to optimize expression in plants (SEQ ID NO:2; Figure IB). 
The nucleic acid sequence of the ROS repressor was examined and the coding region 
modified to optimize for expression of the gene in plants, using a procedure similar to 

15 that outlined by Sardana et al. (1996). A table of codon usage from highly expressed 
genes of dicotyledonous plants was compiled using the data of Murray et al. (1989). 
The ros nucleotide sequence was also modified (SEQ ID NO:2; Figure IB) to ensure 
localization of the ROS repressor to the nucleus of plant cells, by adding a SV40 
nuclear localization signal (Rizzo,P. et al., 1999; The nuclear localization signal 

20 resides at amino acid positions 126-132; accession number AAF28270). 

The ros gene is cloned from Agrobacterium tumefaciens by PCR. The 
nucleotide sequence encoding the ROS protein is expressed in, and purified from, E. 
coli, and the ROS protein used to generate an anti-ROS antiserum in rabbits using 
25 standard methods (Sambrook et al.). 

Example 2: Constructs placing a nucleic acid sequence of interest under 
transcriptional control of regulatory regions that have been modified to contain 
30 ROS operator sites, and preparation of reporter lines. 



p74-315: Construct for The Expression of GUS Gene Driven by a CaMV 35S 
Promoter Containing a ROS Operator Downstream of TATA Box (Figure 
5(A)). 



WO 03/104462 PCT/CA03/00822 

The BamHL-EcoBCV fragment of CaMV 35S promoter in pBI121 is cut out and 
replaced with a similar synthesized DNA fragment in which the 25 bp immediately 
downstream of the TATA box were replaced with the ROS operator sequence: 

TATATTTCAATTTTATTGTAATATA (SEQ ID NO: 7). 

Two complementary oligos, ROS-OPDS (SEQ ID NO:28) and ROS-OPDA (SEQ ID 
NO:29), with built-in Banim-EcoRV ends, and spanning the BaniKL-EcoRV region of 
CaMV35S, in which the 25 bp immediately downstream of the TATA box are 
replaced with the ROS operator sequence (SEQ ID NO: 7), are annealed together and 
then ligated into the BamHl-EcdKV sites of CaMV35S. 

ROS-OPDS: 5'-ATC TCC ACT GAC GTA AGG GAT GAC GCA CAA TCC CAC 
TAT CCT TCG CAA GAC CCT TCC TCT ATA TAA TAT ATT 
TCA ATT TTA TTG TAA TAT AAC ACG GGG GAC TCT AGA G- 
3' (SEQ ID NO:28) 

ROS-OPDA: 5'- G ATC CTC TAG AGT CCC CCG TGT TAT ATT ACA ATA 
AAA TTG AAA TAT ATT ATA TAG AGG AAG GGT CTT GCG 
AAG GAT AGT GGG ATT GTG CGT CAT CCC TTA CGT CAG 
TGG AGA T-3' (SEQ ID NO:29) 

The p74-315 sequence from the EcoKV site (GAT ATC) to the first codon (ATG) of 
GUS is shown below (TATA box - lower case in bold; the synthetic ROS sequence - 
bold caps; a transcription start site - ACA, bold italics; BamHI site - GGA TCC; and 
the first of GUS, ATG, in italics; are also indicated): 

5'-GAT ATC T CC ACT GAC GTA AGG GAT GAC GCA CAA TCC CAC TAT 
CCT TCG CAA GAC CCT TCC TCt ata taA TAT ATT TCA ATT TTA TTG 
TAA TAT AACACG GGG GAC TCT AGA GGA TCC CCG GGT GGT CAG TCC 
CttATG-3' 
(SEQIDNO:40) 
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p74-316: Construct forThe Expression of GUS Driven by a CaMV 35S Promoter 
Containing a.ROS Operator Upstream of TATA Box (Figure 5(B)). 

The BamUl-EcoRY fragment of CaMV 35S promoter in pBI121 is cut out 
and replaced with a similar synthesized DNA fragment in which the 25 bp 
immediately upstream of the TATA box are replaced with the ROS operator sequence 
(SEQ ID NO: 7). Two complementary oligos, ROS-OPUS (SEQ ID NO:30) and 
ROS-OPUA (SEQ ID NO:31), with built-in BaniHl-EcoKV ends, and spanning the 
BamHL-EcoRV region of CaMV35S, in which the 25 bp immediately upstream of the 
TATA box were replaced with a ROS operator sequence (SEQ ID NO: 7), are 
annealed together and then ligated into the BamBI-EcoRV sites of CaMV35S. 

ROS-OPUS: 5--ATC TCC ACT GAC GTA AGG GAT GAC GCA CAA TCT ATA 
TTT CAA TTT TAT TGT AAT ATA CTA TAT AAG GAA GTT 
CAT TTC ATT TGG AGA GAA CAC GGG GGA CTC TAG AG -3' 
(SEQ ID NO:30) 

ROS-OPUA: 5'- G ATC CTC TAG AGT CCC CCG TGT TCT CTC CAA ATG 
AAA TGA ACT TCC TTA TAT AGT ATA TTA CAA TAA AAT 
TGA AAT ATA GAT TGT GCG TCA TCC CTT ACG TCA GTG 
GAG AT-3' (SEQ ID NO:3 1) 

The p74-316 sequence from the EcoRV site (GAT ATC) to the first codon (ATG) of 
GUS is shown below (TATA box - lower case in bold; the synthetic ROS sequence - 
bold caps; a transcription start site - ACA, bold italics; BaniHl site - GGA TCC; the 
first codon of GUS, ATG -italics, are also indicated): 

5'-GAT ATC TCC ACT GAC GTA AGG GAT GAC GCA CAA TCT ATA TTT 
CAA TTT TAT TGT AAT ATA Cta tat aAG GAA GTT CAT TTC ATT TGG 
AGA GA4 CAC GGG GGA CTC TAG AGG ATC CCC GGG TGG TCA GTC CCT 
TAT G-V (SEQ ID NO:41) 
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p74-309: Construct for The Expression of GUS Driven by a CaMV 35S Promoter 
Containing ROS Operators Upstream and Downstream of TATA Box (Figure 
5(C)). 

5 The BamHl-EcoKV fragment of CaMV 35S promoter in pBI121 is cut out 

and replaced with a similar synthesized DNA fragment in which the 25 bp 
immediately upstream and downstream of the TATA box were replaced with two 
ROS operator sequences (SEQ ID NO: 7). Two complementary oligos, ROS-OPPS 
(SEQ ID NO:32) and ROS-OPPA (SEQ ID NO:33), with built-in Bamm-EcdKV 

10 ends, and spanning the BaniHI-EcdRV region of CaMV35S, in which the 25 bp 
immediately upstream and downstream of the TATA box are replaced with two ROS 
operator sequences, each comprising the sequence of SEQ ID NO: 7 (in italics, 
below), are annealed together and ligated into the BamSl-EcdSN sites of CaMV35S. 

15 ROS-OPPS: 5*-ATC TCC ACT GAC GTA AGG GAT GAC GCA CAA TCT ATA TTT 
CAA TTT TAT TGT AATATA CTA TAT AAT ATA TTT CAA TTT TAT 
TGT AAT ATA ACA CGG GGG ACT CTA GAG-3 1 (SEQ ID NO:32) 

ROS-OPPA: 5 -G ATC CTC TAG AGT CCC CCG TGT TAT ATT ACA ATA AAA 
20 TTG AAA TAT ATT ATA TAG TAT ATT ACA ATA AAA TTG AAA TAT 

AGA TTG TGC GTC ATC CCT TAC GTC AGT GGA GAT-3' (SEQ ID 
NO:33) 

The p74-309 sequence from the EcoRV site (GAT ATC) to the first codon (ATG) of 
25 GUS is shown below (TATA box - lower case in bold; two synthetic ROS sequence - 
bold caps; a transcription start site - ACA, bold italics; BamHl site - GGA TCC; the 
first codon of GUS, ATG -italics, are also indicated): 

5'-GAT ATC TCC ACT GAC GTA AGG GAT GAC GCA CAA TCT ATA TTT 
30 CAA TTT TAT TGT AAT ATA Cta tat aAT ATA TTT CAA TTT TAT TGT 
AAT ATA ACA CGG GGG ACT CTA GAG GAT CCC CGG GTG GTC AGT CCC 
TTA TG-3' (SEQ ID NO:42) 



WO 03/104462 PCT/CA03/00822 

p74-118 Construct for The Expression of GUS Driven by a CaMV 35S 
Promoter Containing three ROS Operators Downstream of TATA Box (Figure 
5(D)). 

5 The BamBX-EcdBN fragment of CaMV 35S promoter in pBI121 is cut out 

and replaced with a similar synthesized DNA fragment in which a region downstream 
of the TATA box was replaced with three ROS operator sequences (SEQ ID NO:43). 
The first of the three synthetic ROS operator sequences is positioned immediately of 
the TAT box, the other two ROS operator sequence are located downstream of the 

10 transcriptional start site (ACA). Two complementary oligos with built-in BaniHI- 
EcoKV ends were prepared as describe above for the other constructs were annealed 
together and ligated into the BamHI-EcdKV sites of CaMV35S. 

The p74-118 sequence from the EcoRV site (GAT ATC) to the first codon 
1 5 (ATG) of GUS is shown below (TATA box - lower case in bold; three synthetic ROS 
sequence - bold caps; a transcription start site - ACA, bold italics; BamYLl site - GGA 
TCC; the first codon of GUS, ATG -italics, are also indicated): 

5'-GAT ATC T CC ACT GAC GTA AGG GAT GAC GCA CAA TCC CAC TAT 
20 CCT TCG CAA GAC CCT TCC TCt ata taA TAT ATT TCA ATT TTA TTG 
TAA TAT AACACG GGG GAC TCT AGA GGA T CC TAT ATT TCA ATT TTA 
TTG TAA TAT AGC TAT ATT TCA ATT TTA TTG TAA TAT AAT CGA TTT 
CGA ACC CGG GGT ACC GAA TTC CTC GAG TCT AGA GGA TCC CCG GGT 
GGT CAG TCC CTT^rG-3' (SEQ ID NO:43) 

25 

p76-508: Construct for The Expression of The GUS Gene Driven by the tms2 
Promoter Containing a ROS Operator (Figure 6(B)). 

The tms2 promoter is PCR amplified from genomic DNA of Agrobacterium 
30 twnefaciens 33970 using the following primers: 



primer: 5'-TGC GGA TGC ATA AGC TTG CTG ACA TTG CTA GAA 
AAG- 3' (SEQ ID NO:36) 
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anti-sense primer: 5'-CGG GGA TCC TTT CAG GGC CAT TTC AG- 3'(SEQ 
IDNO:37) 

The 352 bp PCR fragment is cloned into the EcdRV site of pBluescript, and sub- 
cloned into pGEM-7Zf(+). Two complementary oligos, ROS-OP1 (SEQ ID NO:34) 
and ROS-OP2 (SEQ ED NO:35), containing two ROS operators (in italics, below), are 
annealed together and cloned into pGEM-7Zf(+) as a BaniHI/Clal fragment at the 3' 
end of the tms2 promoter. This promoter/operator fragment is then sub-cloned into 
pBI121 as aHin&nJXbal fragment, replacing the CaMV 35S promoter fragment. 

ROS-OP1 : 5'-GAT CCT ATA TTT CAA TTT TAT TGT AAT ATA GCr ATA TTT 
CAA TTT TAT TGT AAT ATA AT-3' (SEQ ID NO:34) 

ROS-P2: 5'-CGA TTA TAT TAC AAT AAA ATT GAA ATA TAG CTA TAT TAC 
AAT AAA ATT GAA ATA 2MG-3'(SEQ ID NO:35). 

As a control, p76-507 comprising a tms2 promoter (without any operator 
sequence) fused to GUS (Figure 4(A)), is also prepared. 

p74-501: Construct for The Expression of The GUS Gene Driven by The Actin2 
Promoter Containing a ROS operator (Figure 7B)). 

The Actin2 promoter is PCR amplified from genomic DNA of Arabidopsis 
thaliana ecotype Columbia using the following primers: 

Sense primer: 5'- AAG CTT ATG TAT GCA AGA GTC AGC-3'(SEQ ID NO:38) 

Spel 

Anti-sense primer: 5'- TTG ACT AGT ATC AGC CTC AGC CAT-3'(SEQ ID 

NO:39) 

The PCR fragment is cloned into pGEM-T-Easy. Two complementary oligos, ROS- 
OP1 (SEQ ID NO:34) and ROS-OP2 (SEQ ID NO:35), with built-in BamBl and Clal 
sites, and containing two ROS operators, are annealed together and inserted into the 
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Actin2 promoter aWK BglWClal sites replacing the BgtBJClal fragment. This 
modified promoter is inserted into pBI121 vector as a HindKUBamHI fragment. 

As a control, p75-101, comprising an actin2 promoter (without any operator 
5 sequence) fused to GUS (Figure 7(A)), is also prepared. 

The various constructs are introduced into Arabidopsis, as described above, 
and transgenic plants are generated. Transformed plants are verified using PCR or 
Southern analysis. Figure 8(A) show Southern analysis of transgenic plants 
10 comprising a first genetic construct, for example, p74-309 (35S-ROS operator 
sequence-GUS, Figure 5(C)) 

Example 3: Crossing of transgenic lines containing fusion constructs with 
transgenic lines containing GUS reporter constructs. 



Transgenic Arabidopsis lines containing fusion constructs (second genetic 
constructs) are crossed with lines containing appropriate reporter (GUS) constructs 
(first genetic constructs). To perform the crossing, open flowers are removed from 
plants of the reporter lines. Fully formed buds of plants of the repressor lines are 
20 gently opened and emasculated by removing all stamens. The stigmas are then 
pollinated with pollen from plants of the repressor lines and pollinated buds are 
tagged and bagged. Once siliques formed, the bags are removed, and mature seeds are 
collected. Plants generated from these seeds are then used to determine the level of 
reporter gene (GUS) repression by GUS staining. Levels of GUS expression in the 
hybrid lines are compared to those of the original reporter lines. Plants showing a 
modified GUS expression levels are further characterized using PCR, Southern and 
Northern analysis. 

Example 4: Preparation of a Chromatin Remodelling Factor 

HDAC was used as an example of a chromatin remodelling factor that may be 
isolated from an organism. Transcription factors that recruit histone deacetylase 
(HDAC) to target promoters in Brassica napus were identified in vivo by screening a 
yeast two-hybrid library using the Arabidopsis thaliana HDA19 as bait. A cDNA 



15 
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clone that encodes a novel protein, bnKCPl, containing a kinase-inducible domain 
(KID) was identified. Southern blot analysis indicated that the bnKCPl gene belongs 
to a small gene family of at least three members, and northern blot analysis showed 
that it was strongly expressed in stems, flowers, roots and immature siliques seeds, 
5 but not in leaf blades. In vitro protein binding assays showed that the protein is able to 
interact with both HDA19 and histone acetyltransferase (HAT) and that the KID 
domain is required for this interaction with HDA19 and HAT in vitro. When assayed 
in vivo, bnKCPl exerted modest activation of transcription of a reporter gene in yeast. 

10 The cAMP-responsive element (CRE) binding protein (CREB) binds to the 

CREB-binding protein (CBP) in response to extracellular stimuli that induce 
intracellular accumulation of secondary messengers Ca 2+ and cAMP. The KID 
domain is highly conserved in the CREB family proteins, CREB, CREM and ATF-1 
(Montminy, 1997). Each protein in this family has a serine phosphorylation site 

15 (RRPS 133 ) within the KID domain, which is recognised by protein kinase A (PK-A) 
that phosphorylates S 133 . PK-A in turn is induced by outside stimuli that induce 
intracellular accumulation of Ca 2+ and cAMP. CREB binding activity is regulated 
through S 133 phosphorylation, which leads to interaction of CREB with CBP. The 
KIX domain of CBP is required for interaction with the KID domain of CREB having 

20 a phosphorylated S 133 (see review Montminy, 1997). Interestingly, CBP possesses 
intrinsic HAT activity (Bannister and Kouzarides, 1996; Ogryzko et al., 1996) 
suggesting that recruitment of CBP to target promoters by the transcription activator 
CREB may contribute to the transcriptional activation of CRE-dependent genes by the 
involvement of histone acetylation at the genetic loci of target genes. 



In Arabidopsis, a HAT gene encoding an ortholog of the yeast GCN5 was 
found to bind in vitro to two proteins similar to the yeast HAT-adaptor proteins 
ADA2, ADA2a and ADA2b (Stockinger et ah, 2001). Moreover, the transcription 
activator CBF1 was found to bind to both HAT and ADA2, indicating that these 
30 proteins might be recruited to target cold-inducible genes by binding to CBF1 
(Stockinger et al y 2001). The finding that the Arabidopsis ADA2 and GCN5 genes 
share similarity with their counterparts in yeast and humans suggests that chromatin 
remodelling complexes are conserved even among evolutionary distant organisms. 



25 
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Brassica napus L. cv Cascade (winter type), Westar (spring type) and DES010 
(spring type) were used for the isolation of genomic DNA and total RNA. Leaves, 

5 flowers, stems, siliques and immature seeds were harvested from plants cultured in a 
controlled-environment greenhouse programmed for a photoperiod of 16h day and 8h 
night. Roots were obtained by culturing sterilized seeds in 0.8% agar plates containing 
Vi MS medium and 1% sucrose. For cold acclimation (4°C), abscisic acid (250 /*M), 
drought and high salt (850 mM NaCl) treatments, four-leaf stage seedlings were 

10 treated and fourth fully expanded leaf blades were harvested as described by Gao et 
al. (2002). LaC13 and inomycin treatments were carried out by watering four-leaf 
stage plants with 20 mM LaCl 3 and 10 jitM inomycin, respectively. Plants were 
covered with Saran Wrap to slow evaporation. 

15 Yeast two-hvbrid screening and cloning 

A yeast two-hybrid cDNA library (Dr. Isobel Parkin, Agriculture and 
Agri-Food Canada Research Centre, Saskatoon) was constructed from poly(A) 
mRNA isolated from the above-ground parts of the four-leaf stage seedlings of B. 
20 napus L. cv. DH12075 and cloned into a GAL4 AD (activation domain) vector pPC86 
using the Superscript Plasmid System for cDNA Synthesis and Plasmid Cloning 
(GibcoL BRL). 

To generate the pDBLeu-HD Al 9 construct, the entire coding region of 
25 Arabidopsis thaliana RPD3-type HDA19 cDNA (Accession # AF195547) was PCR 
amplified using PWO DNA polymerase (Roche) with a forward primer: 

5-GCGTCGACGATGGATACTGGCGGCAATTCGC-3 1 (SEQ ID NO: 44) 



30 and a reverse primer: 

5*-AGGCGGCCGCTTATGTTTTAGGAGGAAACGCC-3\ (SEQ ID NO: 45) 
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The identity of the PCR product was confirmed by DNA sequence analysis and 
inserted into the Sail and Notl sites of the Gal4 DB (DNA binding domain) vector 
pDBLeu in-frame with the GAL4 sequence and used as a bait to screen the B. napus 
cDNA library using PROQUEST Two-Hybrid System (GibcoL BRL). 

Approximately 1 x 10 6 transformants were subjected to the two-hybrid 
selection on synthetic complete (SC) medium lacking leucine, tryptophan and 
histidine but containing 15 mM 3-amino-l,2,4-triazole (3AT®). The expression of the 
HIS3 reporter gene allowed colonies to grow on the selective medium and the putative 
His+ (3AT®) positive transformants were tested for the induction of the two other 
reporter genes, URA3 and lacL. The positive colonies were reassessed by 
retransformation assays and the cloned cDNAs were identified by PCR and DNA 
sequence analysis. 

Southern blot analysis 

Total genomic DNA was isolated from the leaves of B. napus L. cv Westar 
using a modified CTAB (cetyltriethylammonium bromide) extraction method (Gao et 
al., 2002). Briefly, 10 fig of total genomic DNA was digested with EcoRL, Xbal, 
HindDl, PstI, EcoRV and Kpnl restriction endonucleases, separated on a 0.8% 
agarose gel, transferred to Hybond-XL membranes (Amersham Phamacia) and 
hybridized with the bnKCPl open reading frame (ORF) labeled with [a- 32 P]dCTP 
using random primer labeling procedure. The DNA fragment to be used as a probe 
was isolated from a 0.8% agarose gel and purified with a QIAquick Gel Extraction 
Kit (Qiagen), and the probe was purified with a ProbeQuant G-50 Micro Column 
(Amersham Phamacia). Hybridization was performed under high stringency 
conditions (Gao, M.-J. et al., 2002). 

Northern blot analysis 

« 

Total RNA was isolated from the tissues of B. napus L. cv DES010. These 
included leaves and stems of four-leaf stage seedlings, flowers, immature siliques of 
adult plants, and roots of cultured seedlings as described by Gao et al. (2001). Probe 
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labelling, hybridization, washing and membrane stripping were performed as 
described above in the Southern blot analysis Section. 

Expression and purification of recombinant GcnS and HDA19 



The full coding regions of the Arabidopsis HAT, Gcn5 (Dr. M. Thomashow, 
Michigan State University, MI), and HDA19 (Accession # AP195547) were PCR 
amplified, sequence analyzed and inserted in-frame with the GST (glutathione 
s-transferase) into the Sail and Not! sites of vector pGEX-6P-2 (Amersham 
10 Pharmacia). The forward used for the amplification of GcnS was: 

5'- GCGTCGACGATGGACTCTCACTCTTCCCACC-3 , (SEQ IDNO: 46) 
and the reverse primer for GcnS was: 

15 

S-GCGCGGCCGCCTATTGAGATTTAGCACCAGA-S' (SEQ IDNO: 47) 

The forward primer for HDA19 was SEQ ID NO: 44, as listed above, and the reverse 
20 primer was: 

5-GCGCGGCCGCTTATGTTTTAGGAGGAAACGC-3 1 (SEQ ID NO:48). 



25 BL21-CodonPlus (DE3)-RP competent cells (Stratagene). Expression and purification 
under non-denaturing conditions were carried out as described by Gao et al. (Gao, 
M.-J. et al., 2002). The GST-Gcn5 and GST-HDA19 fusion proteins were analyzed 
by 7.5% SDS-PAGE (SDS-polyacrylamide gel electrophoresis) and western blotting 
with rabbit anti-GST-Pi polyclonal antibody (Chemicon) using ECL Western blotting 

30 analysis system (Amersham Pharmacia). 



5 



Recombinant pGEX-6P-2 plasmids were used to transform E. coli 



Generation of deletion mutants of bnKCP 1 
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The two fragments, bnKCPl 1 ' 160 and bnKCPl 1 ' 80 , ancfthe entire coding 
region of bnKCPl DNA encoding amino acids 1-80, 1-160 and 1-215, respectively, 
were amplified by PCR and cloned into the HindUL and^iol sites of pET-28-b vector 
(Novagen, Madison, WI). The primers used for the amplification were as follows: 
bnKCPl 1 ' 160 (240bp): forward primer: 

S'-GCAAGCTTATGGCAGGAGGAGGACCAACT-S' (SEQ ID NO:49), 
reverse primer: 

S'-CGCTCGAGCTCCTCCTCATCATTGTCTTC-S' (SEQ IDNO:50); 
bnKCPl 1 ' 80 (480bp) : forward primer: 
1 5 5'-GCAAGCTTATGGCAGGAGGAGGACCAACT-3' (SEQ ID NO:49), 
reverse primer 

5'- CGCTCGAGATGAACAGGCAAAAGAGGCAT-3 ' (SEQ ID NO:5 1); 

20 

bnKCPl (645bp): forward primer: 

5'-GCAAGCTTATGGCAGGAGGAGGACCAACT-3'(SEQ ID NO:49), 

25 reverse primer 

5'- CGCTCGAGCTCaTCTTCTTCTTCTTCTTC-3 1 (SEQ ID NO:52). 

In vitro protein interaction assays 
30 Full-length bnKCPl and truncated mutant bnKCPl 1 " 160 and bnKCPl 1 ' 80 

proteins labeled with [ 35 S]methionine were produced using TNT-Quick Coupled 
Transcription/Translation System (Promega) according to the manufacture's 
instructions, with some modifications. A total of 1 fil of RNase inhibitor (GibcoL 
BRL) and 1 jttl of protease inhibitors set (Roche) were added to the lysate reaction. 
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After incubation fofWmin at 30°C, RNase A was added to tfie reaction to a final 
concentration of 0.2 mg/ml and incubated for 5 min at the same temperature. 

In vitro protein interaction was detected with GST pulldown affinity assays as 
5 described by Ahmad et al. (1999) with some modifications. Briefly, 6 /xg of GST or 4 
lig of GST-fusion protein was incubated with 5 jttl of [ 35 S]Met-labeled translation 
mixture in 200 ix\ of bead-binding buffer (50 mM'K-phosphate, pH 7.6, 450 mM KC1, 
10 mM MgCl 2 , 10% glycerol, 1% Triton X-100, 1% BSA and 1 fil of diluted 1:12 
protease inhibitors set) for lh at room temperature. After incubation, 20 /il of 50% 

10 slurry of glutathione-Sepharose beads containing 10 mg/ml of BSA and 4 fig of EtBr 
was mixed with the reaction mixture followed by gentle rotation for lh at 4°C. After 
washing six times with 1.2 ml of bead-binding buffer without BSA and EtBr but 
containing 12 fil of protease inhibitors set (Roche), the bound proteins were eluted 
with 30 ix\ of 2 X SDS loading buffer, boiled for 2 min and analyzed by 12% 

15 SDS-PAGE. After electrophoresis, the gels were dried, treated with Amplify 
(Amersham Pharmacia) and subjected to fluorography. 



In vivo protein assays 



20 The entire region of bnKCPl and the two fragments, bnKCPl 1 ' 160 and 

bnKCPl 1 - 80 , were PCR amplified and cloned into the Sail and Notl sites of pPC86 
vector (GibcoL BRL) in-frame with the GAL4 AD sequences to generate constructs 
pPC86-bnKCPl, pPCge-bnKCPl 1 " 160 and pPCSe-bnKCPl 1 " 80 . The oligonucleotide 
primers used in PCR amplification were as follows: 

25 

bnKCPl, bnKCPl 1 - 160 and bnKCPl 1 - 80 forward primer 

5'- GCGTCGACGATGGCAGGAGGAGGACC AACT-3 * (SEQ ID NO:53) 



bnKCPl reverse primer 
30 5'- GCGCGGCCGCCTCATCTTCTTCTTCTTCCTC -3' (SEQ ID NO:54) 

bnKCPl 1 - 160 reverse primer 

5'- GCGCGGCCGCATGAACAGGCAAAAGAGGCAT -3' (SEQ ID NO:55) 
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bnKCPl *~ 80 reverse pmrier 
5 f - GCGCGGCCGCCTCCTCCTCATCATTGTCTTC-S 1 (SEQ ID NO:56) 

For in vivo protein interaction assays, the MaV203 yeast cells carrying the reporter 
5 gene lacZ and the construct pDBLeu-HDA19 5 in which the HDA19 was fused in- 
frame with GAL4DB, were transfected with either of the plasmids pPC86-bnKCPl, 
pPC86-bnKCPl 1-160 and pPCSe-bnKCPl 1 " 80 or the vector alone. The expression of 
lacZ reporter gene was quantified by measuring the /3-galactosidase activity using 
. chlorophenol red-j3-D-galactopyranoside (CPRG) according to the manufacturer's 
10 instructions (GibcoL BRL). Two yeast control strains A and B (GibcoL BRL) were 
used as negative and positive controls, respectively. 

Site-directed mutagenesis ( SDM) 

15 The QuickChange site-directed mutagenesis kit (Stratagene) was used to 

replace the serine residue in the PK-A phosphorylation site (RRPS 188 ) within the KID 
domain with a glycine residue to generate bnKCPl G 188 according to the 
manufacturer's instructions. The two oligonucleotide primers used in SDM were as 
follows: 



forward primer: 

5 - GATGTTCTTGCGAGGAGACCAGGATTCAAGAACAGAGCATTGAAG-3 , 
(SEQ ID NO:57) 

25 reverse primer: 

5'- CTTCAATGCTCTGTTCTTGAATCCTGGTCTCCTCGCAAGAACATC-3 1 
(SEQE)NO:58) 

The introduced mutation was confirmed by DNA sequencing, and the mutated 
30 bnKCPl G m was cloned into the Hindm and Xhol sites of pET-28b vector to 
generate pET-bnKCPIG 188 , which was then used for in vitro protein interaction 
assays as described above. 



20 
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Transactivation assassins veast one-hvbrid system 

Effector plasmids pDBLeu-bnKCPl 1 ' 160 , pDBLeu-bnKCPl 1 ' 80 , pDBLeu- 
bnKCPl 81 " 215 and pDBLeu-bnKCPlwere constructed by ligating the PCR-amplified 
5 fragments AbnKCPl 1 ' 160 , AbnKCPl 1 - 80 , AbnKCPl 81 ' 215 and the coding region of 
bnKCPl into the SaWNotl sites of pDBLeu vector (GibcoL BRL) in-frame with the 
GAL4 DB sequence. The oligonucleotide primers for PCR amplification were as 
follows: 

10 bnKCPl forward primer 

5'-GCGTCGACGATGGCAGGAGGAGGACCAACT-3' (SEQ ID NO: 53), 



bnKCPl reverse primer 

i'-GCGCGGCCGCCTCATCTTCTTCTTCTTCCTC-S' (SEQ IDNO:54) 

15 

bnKCPl 1 ' 160 forward primer 

5'-GCGTCGACGATGGCAGGAGGAGGACCAACT-3' (SEQ ID NO: 53),. 

bnKCPl 1 ' 160 reverse primer 
20 5'- GCGCGGCCGCATGAACAGGCAAAAGAGGCAT -3' (SEQ ID NO:55) 

bnKCPl 1 ' 80 forward primer 

5-GCGTCGACGATGGCAGGAGGAGGACCAACT-3' (SEQ ID NO: 53), 



25 bnKCPl 1 ' 80 reverse primer 

5'- GCGCGGCCGCCTCCTCCTCATCATTGTCTTC -3' (SEQ ID NO:56) 

bnKCPl 81 ' 215 forward primer 

5'-GCGTCGACGCTAGGGTTGGCTTCATTGAGA-3' (SEQ ID NO:59) 

30 

bnKCPl 81 ' 215 reverse primer 

5*-GCGCGGCCGCCTCATCTTCTTCTTCTTCCTC-3' (SEQ IDNO:54) 
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The three reporter genes, lacZ, HIS3 and URA3 9 which were chromosomally 
integrated in the genome of MaV203 yeast cells were driven by unrelated promoters 
containing GAL4 DNA binding sites (GibcoL BRL). For transient assays, the effector 
constructs or the negative control vector pDBLeu were transferred to the MaV203 
5 yeast cells. The /3-galactosidase activity was measured using CPRG (chlorophenol 
red-jS-D-galactopyranoside) according to the manufacturer's instructions (GibcoL 
BRL). The MaV203 cells cont aining plasmids pDBLeu-HD Al 9 and pPC86-bnKCPl 
were used as the positive control. In addition, we used three yeast control strains A, B, 
and C (GibcoL BRL), which were developed to contain plasmid pairs expressing 
10 fusion proteins with none, weak and moderately strong protein-protein interaction 
strength, respectively. 

Transient expression of the GUS-bnCKPl fusion protein 

15 The oligonucleotide primers for PCR amplification of the entire coding region 

of bnKCPl were as follows: 

forward primer 

5'-GCGAATTCATGGCAGGAGGAGGACCAACT-3' (SEQ ID NO:60), 

20 

reverse primer 

5-CGGAGCTCCTCaTCTTCTTCTTCnn?CTTC-3 f (SEQ ID NO:61). 

The amplified sequence was cloned into the EcciRl and Seal sites of the binary vector 
25 p79-637, a derivative of the vector CB301, to generate construct p77-132, which 
contains GUS-bnKCPl fusion under control of the CaMV35S promoter. The onion 
epidermal layers were transformed with Agrobacterium culture prepared as described 
by Kapila et al (1997) with a few modifications. Briefly, the onion inner epidermal 
layers were peeled, placed into a culture of Agrobacterium tumefaciens strain 
30 GV3101 pMP90 containing either p79-637, for GUS expression only, or p77-132 and 
subjected to continuous vacuum of -85 kPA for 20 min. After incubation at 22°C 
under 16h light condition for 7 days the tissues were placed into GUS staining 
solution [100 mM potassium phosphate buffer (pH 7.4), 1 mM EDTA, 0.5 mM 
K3Fe(CN) 6 , 0.5 mM K4Fe(CN) 6 , 0.1% Triton X-100, 1 mM 5-bromo-4-chloro-3- 
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indolyl-zS-D-glucvironide^CX-g 1110 )]. vacuum infiltrated for 20 nun at -85 kPa and 
incubated overnight at 37°C. To determine the interceUular location of nuclei, tissues 
were stained with the nucleus-specific ^6-diarmdmo-2-phenylindole (DAPI) solution 
(14 /xg/ml DAPI, 0.1 x PBS, 90% glycerol) (Varagona et al, 1991) and viewed under a 
Zeiss microscope using both fluorescence and bright-field optics. 

Cloning of the B. navus KCP protein 

To identify proteins that interact with HDA19 in B. napus, the ORF of 
Arabidopsis HDA19 fused to the yeast Gal4 DNA binding domain was used as bait in 
a yeast two-hybrid screening of a B. napus cDNA library linked to the yeast Gal4 
activation domain. Several positive clones were obtained on the basis of the induction 
of three yeast reporter genes, HIS3, URA3 and lacZ and DNA sequence analysis. One 
of these clones (963 bp), pPC86-bnKCPl, encodes a 23.5 kDa protein that contains a 
putative kinase-inducible domain (KID)-like motif, and hence was designated 
bnKCPl (B. napMS KID-containing protein 1). 

Alignment of deduced amino acid sequences indicated that bnKCPl shares an 
82% amino acid identity with atKCP, an Arabidopsis unknown 26.6 kDa protein 
(AY088175, At5g24890). It also shares high similarity in the conserved region of 
approximately 55 amino acids (GKSKS domain) with other two other atKCP-like 
Arabidopsis unknown proteins, atKCLl (CAB45910, At4g31510) and atKCL2 
(AAD23890, At2g24550) (FigurelOA). 

To estimate the bnKCPl gene copy number in Brassica napus we carried out 
Southern blot analysis on of total genomic DNA digested with restriction 
endonucleases using the entire open reading frame of bnKCPl for probing under high 
stringency conditions (Figure. 12). Digestion with EcoRI (EI), Hindm (H), PstI (P), 
EcoKV (EV) and Kpnl, none of which has a cutting site within bnKCPl, resulted in 
the detection of three bands, whereas digestion with Xbal generated six bands, 
because of the existence of an internal cutting site formal in the bnKCPl gene. This 
result indicates that bnKCPl belongs to a small gene family of three members in the 
Brassica. napus genomes. 



WO 03/104462 



PCT/CA03/00822 




63 



Structural features oreEe bnKCPl protein 

The ORF of bnKCPl gene codes for a 215 amino acid polypeptide product of 
polypeptide with several functional motifs (Figure 11). Based on a search of protein 
5 localization sites using PSORT program (http://psort.nibb.ac.jp; Nakai and Kanehisa, 
1992), bnKCPl appears to be is a nuclear protein containing a pat7 nuclear 
localization signal (NLS) PLNKKRR (SEQ ID NO: 62; Figure 10A, residues 
127-133). Three acidic motifs (I, II and m) and a serine-rich (S-rich) region (residues 
34-58) may function in transcription activation by bnKCPl (Johnson et al., 1993). 
10 The charged motif GKSKS (residues 88-143), which is conserved in all four protein 
orfhologs (Fig. 10A), is rich in basic residues and encompasses the NLS. This 
suggests that this domain serve the may function of a DNA-binding motif (Figure 1 1). 
In addition, bnKCPl is extremely hydrophilic (Figure 11) suggesting bnKCPl is an 
active element in the nuclear matrix. 



Amino acid sequence analysis also revealed that bnKCPl has a OD-like motif 
(residues 161-215, Figure 10A) with alpha structure at its C-terminal region (Figure 
1 1). The KID is highly conserved in mammalian CREB protein family and functions 
in transactivation and protein binding (Montminy et al., 1997). The KID in bnKCPl 
20 has a high similarity to the CREB family member ATF-1 (Figure 10B, C) and 
contains a protein kinase A (PK-A) phosphorylation site (RRPS) that is conserved in 
the CREB family of proteins (Figure 10B). 

Interaction of bnKCPl with HDA19 and Gcn5 



To confirm the interaction detected in the yeast two-hybrid system between 
the bnKCPl protein and HDA19, GST pulldown assays were performed using in vitro 
translated bnKCPl labeled with [ 35 S]Methionine. The bnKCPl protein was tested for 
its ability to interact with recombinant GST-HDA19 or GST-Gcn5 fusions expressed 



15 



25 



30 in£. colu 



As shown in Figure 13B, bnKCPl bound to both GST-HDA19 and GST-Gcn5 
fusion proteins, but not to GST alone. To reassess the interaction of bnKCPl with 
Gcn5 in vivo, the ORF of the Arabidopsis Gcn5 was fused to the yeast Gal4 DNA 
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binding domain in pPBLeu vector and then used to transform yeast MaV203 cells 
expressing bnKCPl fused to the yeast Gal4 activation domain in pPC86 vector. The 
transformants showed induction of the three reporter genes, HIS3, URA3 and lacZ at a 
relatively lower level when compared with the induction levels in transformants with 
bnKCPl and HDA19 (data not shown). This result suggests that bnKCPl has a 
preference for binding to HDA19 in vivo. 



To map the protein binding domain of the bnKCPl protein, two C-terminal 
truncated mutants of bnKCPl lacking the KID domain were constructed. These are 

10 AbnKCPl 1 " 160 (residues 1-160) and AbnKCPl 1 " 80 (residues 1-80) as shown in Figure 
13 A. These truncated mutants were assayed for in vitro interaction with the 
recombinant GST-HDA19 or GST-Gcn5 fusion proteins. The two mutant proteins, 
AbnKCPl M60 and AbnKCPl M60 , exhibited no interaction with either GST-HDA19 or 
GST-Gcn5 indicating that the KID domain of bnKCPl protein is essential for binding 

15 to HDA19 and Gcn5. 

The importance of the KID domain for protein binding was also determined in 
vivo using the yeast two-hybrid system. MaV203 yeast cells were co-transformed 
with pDBLeu-HDA19, and either pPC86-bnKCPl, pPCSe-bnKCPl 1 " 160 , pPC86- 
20 bnKCPl 1 " 80 or pPC86 alone (Figure 13C ). /3-galactosidase activity was reduced by at 
least 50% when pDBLeu-expressing cells were transformed with plasmids expressing 
either AbnKCPl^ 160 or AbnKCPl 1 " 80 , both of which lacked KID, as compared to the 
full-length bnKCPl. This finding demonstrates that KID is critical for bnKCPl 
interaction with HDA19 in vivo. 

25 

To investigate the importance of S 188 for bnKCPl interaction with HDA19, the 
S 188 residue in bnKCPl was mutated to G 188 using site-directed mutagenesis to obtain 
bnKCPG 188 protein (Fig 14). This mutated protein was then tested for binding to 
HDA19 in vitro. When compared to bnKCPl, the mutated protein, bnKCPG 188 , has 
30 significantly reduced binding to HDA19 (Fig 14). This confirms that S 188 is essential 
for optimal interaction between bnKCPl and HDA19. 

Expression pattern of the bnKCPl gene 
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The expression pattern of the bnKCPl gene was analyzed by Northern blot 
analysis of total RNA extracted from various organs of B. napus. As shown in Figure 
15, two transcripts of similar sizes appear to hybridize to bnKCPl , indicating the 
existence of two homologs of bnKCPl mRNAs in B. napus. These transcripts 
5 accumulated at high levels in flowers, roots, stems and immature siliques, and at low 
levels in leaves with petioles, but were undetectable in leaf blades (Figures 15, 16). 

To investigate the pattern of bnKCPl expression in response to environmental 
stress conditions, total RNA was isolated from leaf blades of four-leaf stage B napus 

10 seedlings that were exposed to low temperature (4°C), drought, high salt (NaCl), and 
ABA treatment, and used for northern blot analysis using a bnKCPl probe. 
Transcripts of both bnKCPl homologs accumulated in leaves in response to cold 
treatment. The lower size (-0.9 kb) transcript appears to be induced within 4h of cold 
treatment and about 4h earlier than the higher molecular weight (l.lkb) one (Figure 

15 16A). The bnKCPl transcript appears to accumulate in response to low temperature 
(4°C), but expression was not detected in leaf blades of plants grown under drought 
condition for up to 4 days, high salt stress for up to 11 days, or upon exogenous 
application of ABA for up to 8 hours (data not shown). Expression of bnKCPl in the 
stems, was repressed upon cold treatment (Figure 16A), suggesting the response of 

20 bnKCPl transcript to low temperature or the recruitment of HDA19 and HAT to the 
promoters of cold responsive genes is organ specific. 

Since cold acclimation is known to be associated with elevated levels of 
intracellular concentrations of Ca 2+ , tests to determine whether Ca +2 has any effect on 

25 bnKCPl expression were performed. Northern blot analysis was performed using 
total RNA isolated from leaves of seedlings treated with Ca 2+ channel blocker LaCl 3 
and the Ca 2+ ionophore inomycin at room temperature. Induction of bnKCPl 
expression upon treatment with inomycin was rapid (2 hrs) but short-lived. The 
bnKCPl transcript was undetectable in leaves of seedlings treated with the LaCl 3 

30 (Figure 16B). 

Transcription activation bv bnKCPl 
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To determinewhether bnKCPl functions as a transcription activator, 
transactivation experiments were carried out in yeast. A yeast strain carrying three 
reporter genes, lacZ, HISS and URAS, driven by promoters fused to GAL4 DNA 
binding sites and independently integrated into the yeast genome were transfected 

5 with the effector plasmid pDBLeu-bnKCP 1 comprising bnKCPl fused to the GAL4 
DB under the control of ADH promoter. The effector stimulated /3-galactosidase 
activity about 8-fold relative to either GAL4 DB alone or yeast control strain A that 
contains plasmid pairs expressing fusion proteins without protein-protein interaction. 
A similar result was obtained when the yeast cells were co-transformed with the 

10 positive control plasmids pDBLeu-HD Al 9 and P PC86-bnKCPl identified by the two- 
hybrid selection (Figure 17A). Reporter genes HISS and URAS were also modestly 
activated by bnKCPl (data not shown). Based on these findings, it can be concluded 
that bnKCPl exerts transactivation of target genes in Brassica napus. 

1 5 These data demonstrate the isolation of a plant protein that contains a putative 

KID domain, which interacts with both GCN5 (HAT) and HDA19. bnKCPl was 
highly expressed in all organs tested, except leaf blades, where it was induced in 
response to cold acclimation, which also resulted in repressing its expression in stems. 
Furthermore, bnKCPl exerts transcription activation of a reporter gene when tested in 

20 yeast, indicating the function of bnKCPl as a transcription factor. 

To map the transactivation domain of the bnKCPPl protein, one N-terminal 
truncated mutant of bnKCPl, AbnKCPl 81 - 215 , and two C-terminal truncated mutants, 
AbnKCPl 1 " 160 and AbnKCPl 1 " 80 (Fig 17B) were generated and used in in vivo 
25 transactivation assays in yeast. As shown in Fig 17C, deletion of the KID or GKSKS 
domains had no significant influence on /3-galactosidase activity, whereas deletion of 
the N-terrninus resulted in approximately 65% reduction in 0-galactosidase activity. 



30 



Nuclear localization of the bnKC Pl protein 

Structural and functional analyses showed bnKCPl to have features typical of 
transcription factors. To confirm that bnKCPl is a nuclear proteins, onion epidermal 
cell layers were transformed with constructs for the expression of either a GUS- 
bnKCPl fusion or GUS alone (Fig 18). Using an Agrobacterium-mediated 
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transformation method (Kapila et al, 1997). As shown in Figure 18, GUS activity 
was visualized exclusively in the cytoplasm of control onion cell layers. In contrast, a 
blue precipitate was localized in the nuclei of cell layers transformed with GUS- 
bnKCPl fusion construct, although there was still a certain amount of cytoplasm 
5 staining, indicating that at least some targeting to the nucleus occurs with the fusion 
protein. 

Ex pression oibnKCPl is organ-specific 

10 The bnKCPl gene appears to be part of a multigene family of three members 

based on Southern blot hybridization. Northern blot analyses showed that two 
members of this gene family are of similar transcript sizes and expression patterns. 
This is consistent with information about bnKCPl orthologs in Arabidopsis, where 
there are one atKCP (At5g24890) and two atKCP-like members (At4g31510 and 

15 At2g24550) of similar sizes ranging from lkb to 1.2kb. Northern blot analysis 
revealed that bnKCPl mRNA was expressed in flowers, roots, stems and immature 
siliques (Figure 14). The transcript accumulation, however, was undetectable in leaf 
blades of B. riapus seedlings, suggesting tissue/organ-specific expression of the 
bnKCPl gene. However, cold treatment induced bnKCPl expression in leaves, but 

20 repressed it in stems. 

The KID domain is conserved in bnKCPl 

Structural analysis of the bnKCPl protein revealed that it was a strongly hydrophilic 
25 protein (23.5 kDa, pi 4.2) and had characteristic features of a transcription factor, 
including a putative nuclear localization signal (NLS), a putative basic DNA binding 
domain, putative acidic activation domains and a protein-protein interaction domain. 

An important structural feature of bnKCPl is the presence of a putative 
30 kinase-inducible domain (KID) with alpha secondary structure at the C-terminal 
region. The KID domain was first identified in mammalian CREB family members 
CREB, CREM and ATF-1. The KID domain in mammalian CREB is involved in at 
least two functions, interaction with CBP/p300 and the site for protein kinase A (PK- 
A) phosphorylation of S 133 (Montminy et a/., 1997; Gonzalez et al, 1991; Quinn, 
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1993; Chrivia et d^^993; Shaywitz et al. 9 2000). Similar to its counterpart in 
CREB, which is involved in protein binding, the KID domain of bnKCPl is required 
for binding to both HDA19 and GCN5 in vitro and in vivo. The ability of bnKCPl to 
interact with HDA19 indicates that bnKCPl -mediated transcription control requires 
5 direct or indirect recruitment of these transcription regulators to promoter regions of 
target genes regulated by bnKCPl. 

Phosphorylation of CREB at Ser 133 is required for the interaction of CREB via 
its KID with CBP and for CREB to activate transcription in response to some 
10 extracellular stimuli (Gonzalez et al, 1989; Chrivia et al., 1993). The KID domain in 
bnKCPl also contains a putative PK-A phosphorylation site (RRPS 188 ), which 
corresponds to the RRPS 133 in mammalian CREB. 

Intracellular level of Ca +2 affect bnKCPl expression 



(Meyer et al, 1993). Therefore, tests to determine whether conditions that increase 
intracellular concentrations of Ca 2+ would induce bnKCPl expression were done. B. 

20 napus seedlings were subjected to one of two treatments, cold or inomycin. Cold 
acclimation is known to increase intracellular Ca 2+ concentrations (Monroy and 
Dhindsa, 1995; Knight et al., 1996), and inomycin is a known calcium ionophore that 
increases Ca 2+ influx (Hurley et al., 1996). These treatments resulted in the induction 
of bnKCPl expression to varying degrees (Figure 16), which indicated that bnKCPl 

25 is induced by high intracellular Ca +2 concentrations. 

These results suggest a molecular mechanism by which bnKCPl functions as 
a transcription factor to regulate gene expression by recruiting HDAC to the promoter 
regions of target genes. 



15 



In mammalian cells, outside stimuli that increase intracellular concentrations 
of Ca 2+ or cAMP induce the expression of not only PK-A, but also the CREB gene 
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Example 5: Characterization of the recruitment factor SCL1 and its interaction 
with the chromatin remodelling factor HDA19 
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To search fo^ranscription factors .additional that recruit histone deacetylase 
(HDAC) to target promoters in Brassica napus, a yeast two-hybrid library was 
screened using the Arabidopsis thaliana HDA19 as bait. This screening resulted in the 
isolation of a cDNA clone that encodes a SCARECROW-like protein, BnSCLl, 
5 which contains a number of putative functional motifs typical of the GRAS family of 
transcription factors. Southern blot analysis indicated that the BnSCLl gene belongs 
to a small gene family of about three members. In vitro and in vivo protein interaction 
assays revealed that BnSCLl interacts physically with HDA19 through the VHIID 
domain. BnSCLl also exerted strong transactivation of the lacZ reporter gene in 
10 yeast, and both N- and C-terminal regions are critical for the transient expression. 
Quantitative RT-PCR and RNA gel blot analysis showed that BnSCLl was expressed 
at relatively high level in roots, moderate level in flowers, weak in mature leaves and 
stems, and barely detectable in immature siliques. The accumulation of BnSCLl 
transcript was regulated by 2,4-D in shoots, roots and matured leaves. Furthermore, 
15 the response of BnSCLl to 2,4-D was modulated by histone deacetylase HDA19. 
These results strongly suggest a molecular mechanism by which BnSCLl functions as 
a transcription factor to regulate gene expression by recruiting HDAC to the promoter 
regions of auxin-responsive genes. 

20 Plant materials 

Brassica napus L. cv. DH12075 was used for DNA and total RNA isolation. 
Leaves, flowers, stems, siliques and immature seeds were harvested from plants 
cultured in a controlled-environment greenhouse programmed for a photoperiod of 
25 16h day and 8h night. Roots were obtained by culturing sterilized seeds in 0.8% agar 
plates containing Vi MS medium (Murashige and Skoog, 1962) and 1% sucrose. 

Tissue treatment 

30 In exogenous applied auxin treatments, four-leaf stage seedlings grown at 

20°C were treated with a foliar spray containing ImM 2,4-D and 50 mM sodium 
phosphate, pH 7.5. The four leaves were collected at 30 min, 60 min and 180 min 
after the first foliar application of 2,4-D. For the measurement of response of shoots 
and roots to auxin, sterilized seeds were germinated on plates in a growth chamber 
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with continuous lightat 20°C, and 10 dpg seedlings were supplied with varied 
concentration of 2,4-D. In the auxin transport inhibition experiments, 9 dpg seedlings 
were incubated in the medium supplemented with 50 uM NPA dissolved in 0.1% 
DMSO for 24 h before the 2,4-D treatment. For the HDAC inhibitor treatments, 10 
mM sodium butyrate was added onto the growth medium and incubated for 24 h 
followed by exogenous 2,4-D application at varied concentrations. 

Yeast two-hvbrid screening an d r.loninp 

A yeast two-hybrid cDNA library was constructed from seedlings of B. napus 
L. cv. DH12075 and screened using a Arabidopsis thaliana RPD3-type HDAC 
(HDA19) as bait, with the methods of ProQuest Two-Hybrid System (GibcoL-BRL) 
as previously described by Gao et al. (2003). The positive colonies were reassessed 
with retransformation experiments and confirmed with in vitro protein interaction 
assays, and the cloned cDNAswere identified by PCR and DNA sequence analysis. 

Gel Blot Analysis 

Total genomic DNA was extracted from the leaves of four-leaf stage B. napus 
using a modified CTAB (cetyltriethylammonium bromide) extraction method, and 
DNA gel blots were prepared and hybridized with the BnSCLl open reading frame 
labeled with [ct- 32 P]dCTP using random primer labeling procedure as described by 
Gao et al. (2003). Total RNA was isolated using hot phenol method with the first 
extraction for 30 sec at 80°C as previously described (Gao et al. 2002). RNA was 
isolated from various tissues, including leaves and stems of four-leaf stage seedlings, 
flowers, immature seeds and siliques of adult plants, and roots of cultured seedlings. 



Quantitative RT-PCR 

Total RNA extracted as described above was treated with Amplification Grade 
Deoxyribonuclease I (GibcoL BRL) following the manufacture's instructions. The 
RNA samples were then directly used for reverse transcription prior to amplification 
without purification. The RT-PCR was quantitatively performed and completed in a 
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one-step reaction using Superscript One-Step RT-PCR System (GibcoL BRL) as 
described by Gao et al (2002). Gene-specific sense and anti-sense primers used to 
generate a 960 bp fragment of Brassica napus Actin, as an internal standard, were as 
described in Gao et al, (2002). Gene-specific primers for the generation of BnSCLl, 
BnlAAl and BnlAAll fragments were as follows: 



BnSCLl (435 bp) 

sense: 5 ' -GATGGACGAACATGCCATGCGTTCC A-3 ' (SEQIDNO:84) 
anti-sense: 5 '-CGCTCGGATCTTCTGAACAAT-3 ' (SEQ ID NO:85) 

10 

BnlAAl (537bp) 

sense: 5'- CC ACGCGTCCGGTACGATGAT-3 ' (SEQ ID NO:86) 
anti-sense: 5'- GAAGTTGAGAAATGGTTTATGA-3 ' (SEQ ED NO:87) 

15 BnlAAl 2 (659bp) 

sense: 5'- ACGCTGGTGCTTCTCCTCCTC-3 ' (SEQ ID NO:88) 
anti-sense: 5'- AAAACCC ATT AGAAGAACCAAGAA-3 ' (SEQ ID NO:89) 

BnlAAl and BnlAAll are clones ML2798 and ML4744, which are homologs 
20 of Arabidopsis IAA1 and MAI 2, respectively, and were identified in a database of 
Brassica napus ESTs that were generated at the Saskatoon Research Centre of 
Agriculture and Agri-Food Canada (www.brassica.ca). 

Expression and purification of recombinant HDA19 

25 The open reading frame (ORF) of the HDA19 was PCR amplified, sequence 

analyzed, inserted in-frame with the GST (glutathione s-transferase) into the vector 
pGEX-6P-2 (Amersham Pharmacia), and transformed into E. coli BL21-CodonPlus 
(DE3)-RP competent cells (Stratagene) as previously described (Gao et al., 2003). 
The recombinant HDA19 protein was expressed and purified under non-denaturing 

30 conditions as described by Gao et a/,2002). The GST-HDA19 fusion protein was 
analyzed by western blotting with rabbit anti-GST-Pi polyclonal antibody (Chemicon) 
using ECL Western blotting analysis system (Amersham Pharmacia). 



In vitro protein interaction assays 
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The entire coding region of BnSCLl and four fragments, BnSCLl 1 ' 358 , 
BnSCLl 1 ' 261 l , BnSCLl 1 ' 21 '' \ and BnSCLl 1 ' 145 encoding amino acids 1-434, 1-358, 1-261 
and 1-217, respectively, were amplified by PCR and cloned into the HiruHH and Xhol 
5 sites of the expression vector pET-28b (Novagen) in-frame with the His-Tag 
sequence. The primers used for amplification were as follows: 

Forward primer for BnSCLl, BnSCLl 1 ' 358 , BnSCLl 1 ' 261 , BnSCLl 1 ' 217 and BnSCLl 1 ' 145 : 
5'- GCAAGCTTATGGACGAACATGCCATGCGTTCCA-3' (SEQ ID NO:90) 

10 

Reverse primer for BnSCLl : 

5'- CGCTCGAGAAAGCGCCACGCTGACGTGGC-3 ' (SEQ ID NO:91) 

Reverse primer for BnSCLl 1 ' 358 : 
15 5'- CGCTCGAGCGCGGAGATCTTCGGACGTAA-3 ' (SEQ ID NO:92) 

Reverse primer for BnSCLl 1 ' 261 : 

5'- CGCTCGAGCCTAATCGCCTTGAAAGATAA-3 ' (SEQ ID NO:93) 

20 Reverse primer for BnSCLl 1 " 217 : 

5'- CGCTCGAGCGCCAC AACCGCCGTGACTCT-3 ' (SEQ ID NO:94) 

Reverse primer for BnSCLl 1 ' 145 : 

5'- CGCTCGAGCGCTCGGATCTTCTGAAC AAT-3 ' (SEQ ID NO:95). 



The TNT-Quick Coupled Transcription/Translation System (Promega) was 
used to produce the full-length BnSCLl protein and the truncated mutants ABnSCLl 1 " 
358 , ABnSCLl 1 " 261 , ABnSCLl 1 " 217 and ABnSCLl 1 " 145 labeled with [ 35 S]methionine as 
previously described (Gao et al., 2003). In vitro protein interaction was detected with 
30 GST pulldown affinity assays as described by Ahmad et al. (1999) and Gao et al., 



25 



(2003). 



hi vivo protein interaction assays 
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The six DNA fragments, BnSCLl 1 ' 358 , BnSCLl 1 - 261 , BnSCLl 1 ' 217 , BnSCLl 1 ' 145 
BnSCLl 146 - 358 and BnSCLl 218 - 438 and the ORF of BnSCLl encoding amino acids acids' 
1-358, 1-261, 1-217, 1-415, 146-358, 218-434 and 1-434, respectively, were PCR 
amplified and cloned into the Sail and Notl sites of pPC86 vector (GibcoL BRL) in- 
frame with the GALA AD sequences to generate constructs pPCSe-BnSCLl 1 ' 358 , 
pPCSe-BnSCLl 1 - 261 , pPCSe-BnSCLl 1 - 217 , pPCSe-BnSCLl 1 " 145 , pPC86-BnSCLl 146 ^ 
358 , P PC86-BnSCLl 218 ^ 38 and P PC86-BnSCLl. PCR amplification was carried out 
using the following primers: 

Forward primer for BnSCLl, BnSCLl 1 - 358 , BnSCLl'- 261 , BnSCLl 1 ' 217 and BnSCLl 1 - 145 : 
5 '-GCGTCGACGATGGACGAACATGCCATGCGTTCCA-3 ' (SEQ ID NO:96) 

Forward primer for BnSCLl 146 - 358 : 
15 5'- GCGTCGACGATTAAGGAGTTTTCCGGTATA-3 ' (SEQ ID NO:97) 

Forward primer for BnSCLl 218 - 434 : 

5 '-GCGTCGACGGAGGATTGCGCCGTCGAGACG-3 ' (SEQ ID NO:98) 

20 Reverse primer for BnSCLl and BnSCLl 218 - 434 : 

5 '-GCGCGGCCGCAAAGCGCCACGGTGACGTGGC-3 ' (SEQ ID NO:99) 

Reverse primer for BnSCLl 1 - 358 : 

5'- GCGCGGCCGCCGCGGAGATCTTCGGAC GTAA-3' (SEQ ID NO:100) 
25 Reverse primer for BnSCLl 1 - 261 : 

5'-GCGCGGCCGCCCTAATCGCCTTGAAAGATAA-3' (SEQ ID NO: 101) 
Reverse primer for BnSCLl 1 - 217 : 

5'-GCGCGGCCGCCGCCACAACCGCCGTGACTCT-3' (SEQ ID NO: 102) 



Reverse primer for BnSCLl 1 - 145 : 

5 '-GCGCGGCCGCCGCTCGGATCTTCTGAACAAT-3 ' (SEQ ID NO.103) 



Reverse primer for BnSCLl 146 - 358 : 
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5 J -GCGCGGCCG^RcGGAGATCTTCGGACGTAA-3 ' (SEQID NO: 100). 

For in vivo protein interaction assays, the MaV203 yeast competent cells 
carrying the lacZ reporter gene were co-transfected with the construct pDBLeu- 
5 HDA19, in which the HDA19 was fused in-frame with GAL4 DB and either of the 
plasmids pPC86-BnSCLl, pPC86-BnSCLl l " 358 , pPCSe-BnSCLl 1 " 261 , pPC86- 
BnSCLl 1 ' 217 , pPC86-BnSCLl M45 , P PC86-BnSCLl 146 - 358 , pPCge-BnSCLl 218 " 438 and 
or the vector pPC86 alone. The expression of lacZ reporter gene was quantified by 
measuring the P-galactosidase activity using CPRG (chlorophenol red-P-D- 
10 galactopyranoside) according to the manufacturer's instructions (GibcoL BRL). Three 
yeast control strains A, B, and C (GibcoL BRL) that contain plasmid pairs expressing 
fusion proteins with none, weak and moderately strong interaction strengths, 
respectively, were used as controls. 

15 Transactivation assay 

MaV203 yeast cells expressing the lacL reporter gene driven by a promoter 
containing GAL4 DNA binding sites (GibcoL BRL) were transformed with the 
pDBLeu-bnKCP 1 1 ~ l 60 , pDBLeu-bnKCP 1 1 " 80 , pDBLeu-bnKCP 1 81215 and pDBLeu- 



20 bnKCPl. These vectors were constructed by ligating the PCR-amplified fragments, 
ABnSCLl 1 - 358 , ABnSCLl 1 " 261 , ABnSCLl 1 ' 217 , ABnSCLl J ~ I45 9 MtnSCLl 146 ' 358 and 



and Notl sites of the vector pDBLeu (GibcoL BRL) in-frame with the GAL4 DB 
sequence. The oligonucleotide primers for the amplification were the same as those 
25 used for the in vivo protein interaction assays. The P-galactosidase activity was 
measured using CPRG according to the manufacturer's instructions (GibcoL BRL). In 
addition to the yeast strains A, B and C, the yeast strains D (GibcoL BRL) that 
contain plasmid pairs expressing fusion protein with strong interaction strength was 
used as controls. 



ABnSCLl and the entire coding region of BnSCLl, respectively, into the Sail 
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Cloning and Sequence Analysis of the BnSCLl Gene 
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HDACor 




is recruited to specific loci by large protein complexes made 



up of transcription activators/cd-activators and repressors/co-repressors, respectively 
(See reviews Kuo and Allis, 1998; Meyer, 2001). Identification of these transcription 
regulatory proteins that interact with HDAC or HAT is a direct approach to defining 

5 nuclear factors that recruit these chromatin remodelling regulators to their target 
promoters and hence affect the expression of the target genes. To isolate proteins that 
bind to HDAC in B. napus, the ORF of Arabidopsis thaliana HDA19 fused to the 
yeast Gal4 DNA binding domain was used as bait in a yeast two-hybrid screening of a 
B. napus cDNA library linked to the yeast Gal4 activation domain. A number of 

10 positive clones were obtained on the basis of the induction of three yeast reporter 
genes HIS3, URA3 and lacZ followed by retransformation and sequencing analysis. 
One of these clones encodes a 51.2 kDa protein with pi 5.1, designated BnSCLl 
(Brassica napus SCARECROW-like protein 1; SEQ ID NO:81). As shown in figure 
20, BnSCLl contains several domains of the SCARECROW (SCR) family of 

15 transcription factors (Laurenzip ef al., 1996). 

Sequence analysis revealed that BnSCLl cDNA (2781 bp) contains two open 
reading frames (ORFs). The first ORF (ORF1) encodes BnSCLl, a polypeptide of 
461 amino acid residues starting at 82 bp from the 5' end, and ORF2 codes for a 

20 polypeptide of 281 amino acids starting at .1687 bp from the 5' end. The linking 
region of the two ORFs is a short sequence of 200 bp. Database search using NCBI 
blast program (Altschul et al., 1997) indicated that the deduced amino acid sequence 
encoded by ORF2 was similar to the human polyposis coli region hypothetical protein 
DPI (accession number A39658), which contains a TB2__DP1JHVA22 domain. 

25 However, the GENESCAN program (Burge and Karlin, 1997) predicts that the 2781 
bps of BnSCLl cDNA encodes one polypeptide only, i.e. the deduced amino acid 
sequence of ORF1. 

Comparison of the deduced BnSCLl amino acid sequence to the NCBI 
30 (http://www.ncbi.nlm.nih.gov) and TAIR (arabidopsis.org) databases results in a list 
of proteins with considerable similarity (Fig. 21). According to the NTI computer 
program (InforMax, Inc.), BnSCLl shares an 89% amino acid identity with AtSCLlS 
(Pysh et al., 1999) or VHS5 (Silverstone et al., 1998), an Arabidopsis SCARECROW- 
like protein (accession number Z99708, At4g36710), while it is 37% identical to 
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AtSCR (accession Ironber U62797). Interestingly, it also snares high similarity 
(66% sequence identity) with a tomato (Lycopersicon esculentuni) protein (accession 
number AF273333), a member of the GRAS/VHIID protein family, encoded by the 



classified BnSCLl, AtSCLlS and LsSCL (Ls) in the same subgroup (Fig. 21). 

The BnSCLl copy number in 5. napus was estimated using DNA gel blot 
analysis on total genomic DNA digested with restriction endonucleases and 

10 hybridized with the ORF of BnSCLl under high stringency conditions (Fig. 22). 
Digestion with EcoRI, Xbal, HindHl, Pstl and Kpnl resulted in the detection of about 
three bands, whereas digestion with EcoRV generated approximately six bands due to 
the existence of an internal cutting site for EcoRY within the BnSCLl gene. This 
result indicates that BnSCLl belongs to a small gene family of approximately three 

15 members in the B, napus genomes. 

BnSCLl is a Member of GRAS/VHIID Family 



20 suggestive functional domains or motifs (Fig. 20). It has two MAT a2-like nuclear 
localization signals (NLSs) (residues 169-173 and 436-440) (Raikhel, 1992). It also 
has a LXXLL motif ( 148 LGSLL 152 (SEQ ID NO: 104)) that was shown to mediate 
interaction of transcription coactivators with nuclear receptors (Heery et al., 1997). 
Amino acid sequence analysis also revealed that BnSCLl has the characteristic 

25 structure for GRAS/VHIID regulatory proteins (Pysh et al., 1999), including a VKUD 
motif that encompasses a putative NLS, two leucine heptad repeats (LHRs) that 
surround the conserved VHHD motif, a PFYRE motif and a C-terminal SAW motif 
that encompasses a putative NLS (Fig. 20). The LHRI-VHITO-LHRII region has been 
thought to function in protein-protein and DNA-protein interactions (Pysh et al., 

30 1999). 

BnSCLl Interacts Physically with HDA19 in vitro and in vivo 



Lateral suppressor gene (Ls) (Schumacher et al., 1999) (Fig. 20). Consistent with 
5 these data, phylogenetic analysis using either NTI Vector or DNA Star program 



The BnSCLl gene encodes a polypeptide of 461 amino acids with several 



WO 03/104462 PCT/CA03/00822 

• ■• ■ 

To confiini fBB interaction of BnSCLl protein with HDA19 that was detected 
in the yeast two-hybrid system, GST pulldown affinity assays were carried out using 
in vzYro-translated BnSCLl labeled with [ 35 S]Methionine. The BnSCLl protein was 
tested for its binding ability to GST-HDA19 fusion protein that was expressed in 
Escherichia coli and purified under non-denaturing conditions. As shown in figure 23, 
BnSCLl bound to recombinant HDA19 protein, while it did not bind to GST alone 
(data not shown). 



To map the protein binding domain of the BnSCLl protein, four C-terminal 
10 truncated mutants of BnSCLl lacking either of the SWA, PFYRE, LHRH or VHIID 
motif (Fig. 23a) were constructed. These truncated mutants were assayed for in vitro 
interaction with the recombinant HDA19 protein. As shown in Figure 23b, the mutant 
proteins exhibited interaction with GST-HDA19 fusion protein with the truncation 
from C-terminal end until the VHIID region was deleted, indicating that the VHIID 
1 5 domain is essential for BnSCLl protein binding to HDA1 9. 



The requirement of the VHUD domain for protein-protein interaction was also 
demonstrated in vivo using the yeast two-hybrid system (Fig. 24). MaV203 yeast 
cells were co-transformed with plasmid pDBLeu-HDA19 and either pPC86-BnSCLl, 

20 pPC86-BnSCLl^ 358 , pPCSe-BnSCLl 1 - 261 , PPC86-BUSCL1 1 - 217 , pPC86-BnSCLl M45 , 
pPC86-BnSCLl 146 ~ 358 , pPC86-BnSCLl 218 " 438 and or the vector pPC86 alone. 
Although (3-galactosidase activity was reduced by at least 50% when pDBLeu- 
expressing cells were transformed with plasmids expressing either of the six mutants 
of BnSCLl protein, as compared to the wild type BnSCLl, the transformants with 

25 plasmids expressing either pPC86-BnSCLl 1-145 or pPCSe-BnSCLl 218 " 438 , both of 
which lacked VHIID motif, showed a further at least 50% reduction in P- 
galactosidase activity as compared to the other mutants. This finding indicates that 
VHTTD domain is critical for BnSCLl interaction with HDA19 in vivo. 

30 BnSCLl Activates Transcription of a Reporter Gene in Yeast 



To further characterize the biological function of BnSCLl, its functions as a 
transcription activator was investigated. Transactivation experiments were performed 
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in yeast (Fig. 25), whereby a yeast strain carrying three reporter genes, lacZ, HISS 
and URA3, driven by promoters fused to GAL4 DNA binding sites and independently 
integrated into the yeast genome were transfected with the effector plasmid pDBLeu- 
BnSCLl comprising BnSCLl fused to the GAL4 DB under the control of the ADH 

5 promoter. Transformation with the effector plasmid resulted in increasing p- 
galactosidase activity similar with yeast strain D that contains plasmid pairs 
expressing fusion proteins with strong protein-protein interaction and approximately 
20-fold relative to either vector pDBLeu alone or yeast control strain A, which 
contains plasmid pairs expressing fusion proteins without protein-protein interaction 

10 (Fig. 25). Reporter genes HISS and URA3 were also strongly transactivated by 
BnSCLl protein (data not shown). These results indicate that BnSCLl significantly 
exhibits transcription activator activity in yeast. 

To map the transactivation domain of the BnSCLl activator, a series of 
15 deletion mutants of BnSCLl protein were generated (Fig. 25a) and used in in vivo 
transactivation assays in yeast. As shown in figure 6b, either of the deletions from C- 
terminal of BnSCLl or any truncation from the N-treminal resulted in a decrease of at 
least 85% in p-galactosidase activity relative to the wild type BnSCLl protein. This 
demonstrates that the transactivation domain of bnKCPl may reside in both the N- 
20 and C-terminal regions. 

BnSCLl Gene is Expressed Mainlv in Roots 

The expression pattern of the BnSCLl gene was analyzed by RNA gel blot 
25 analysis and quantitative RT-PCR using total RNA extracted from various organs of 
B. napus (Fig. 26). As shown in figure 26a, there were two bnSCLl transcripts of 1.6 
kb and 2.8 kb in the RNA blot probed with the ORF of BnSCLl, suggesting the 
existence of either two species of BnSCLl cDNA produced by alternative splicing in 
B. napus genome or a BnSCLl homologue cross-hybridizing to the probe. Both of 
30 them accumulated at highest levels in roots, whereas its expression was weak in 
flowers and stems, and undetectable in leaves and siliques. Results obtained using 
quantitative RT-PCR analysis (Fig. 26b) were consistent with those obtained with 
northern blotting. In addition, RT-PCR analysis revealed strong expression in seedling 
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shoots (Fig. 26b) 




expression pattern is similar to that of Arabidopsis SCR gene 



(Laurenzio et al., 1996) and to those of most SCLs (Pysh et al., 1999). This suggests 
that BnSCLl and SCR may share similar functions in the regulation of root 
development. 

5 

BnSCLl Responds to Auxin Treatment 

The plant hormone auxin plays an important role in cell division, cell 
elongation, cell differentiation, lateral root initiation and gravitropism (Davies, 1995; 

10 Berleth and Sachs, 2001; Liscum and Stowe-Evans, 2000). Recent studies have 
demonstrated that auxin distribution organizes the pattern and polarity in the root 
meristem (Sabatini et al., 1999). To determine whether the dominant role of 
SCARECROW-like proteins (SCLs) in root biology is associated with auxin, 
quantitative RT-PCR was used to examine the expression of BnSCLl gene in four- 

15 leaf stage- and 10 dpg-seedlings treated with the synthetic auxin 2,4-D. As shown in 
Figure 27, BnSCLl mRNA accumulation increased by approximately 50% within 30 
min of application of 1 mM 2,4-D, and then decreased rapidly to a lower level, when 
compared to untreated plants (Fig. 27). 

20 Auxin levels are known to modulate the degradation rate of Aux/IAA 

(auxin/indole-3-acetic acid protein) family members through a proteolytic regulation 
mechanism (Zenser et al., 2001). To examine whether auxin levels also influences the 
expression pattern of BnSCLl gene, quantitative RT-PCR was used to analyse total 
RNA isolated from shoots and roots of 10 dpg seedlings treated with variable 

25 concentrations of 2,4-D ranging from 1 pM to 1 mM (Fig. 28). Expression of BnSCLl 
in shoots was rapidly downregulated by auxin even at the lowest level (1 pM) of 2,4- 
D, indicating that BnSCLl response to auxin is very sensitive (Fig. 28a). BnSCLl 
expression in roots, however, was upregulated by auxin although application of a 
higher concentration (100 \iM) of auxin was required to produce an effect (Fig. 28b). 

30 To determine whether response of BnSCLl gene to auxin was due to the exogenous 
application rather than the intercellular auxin synthesis, seedlings were treated for 24 
h with 50 jiM of naphthylphthalamic acid (NPA), a polar auxin transport inhibitor, 
and the expression of BnSCLl in response to auxin was analysed using quantitative 
One-Step RT-PCR. As can be seen in Figure 28c, the BnSCLl mRNA accumulation 
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profiles were not changed both in shoots and in roots after NPA treatment followed 
by the application of auxin at different concentrations. These results suggest that the 
response of BnSCLl to the application of exogenous auxin was tissue-specific, or the 
expression of BnSCLl may be regulated by auxin distribution in plants. 

5 

Expression of SCR in apical meristems was found to be controlled by 
chromatin assembly factor-1 (CAF-1) (Kaya et aL, 2001), and auxin gene expression 
mutations to be located within an Arabidopsis RPD3-like histone deacetylase gene, 
HDA6, using map-based cloning approach (Murfett et aL, 2001). However, no 

10 alterations in gene expression of endogenous auxin response genes were detected in 
the mutants and no effect of auxin-inducible GUS expression was found after 
seedlings were treated with HADC inhibitor sodium butyrate at concentration up to I 
mM for 24 h (Murfett et aL, 2001). To determine whether BnSCLl response to auxin 
is modulated by HDA19/9 dpg seedlings were treated with 2,4-D at concentrations 

15 ranging from 10" 6 to 10 3 \xM or treated with 50 mM of sodium phosphate buffer as 
control after sodium butyrate treatment for 24 h at a concentration of 10 mM. Relative 
expression was investigated using quantitative One-Step RT-PCR to analyze RNA 
extracted from shoots and roots of seedlings. As shown in Figure 28, although the 
expression pattern of BnSCLl in response to auxin in shoots was different from that 

20 in roots, the inhibition of histone deacetylase led to the expression profiles of BnSCLl 
in shoots were similar to those in roots, i.e. the expression was upregulated by auxin 
at concentration of 1 pM and downregulated by auxin at higher concentrations. The 
fact that HDAC inhibition led to the alteration of BnSCLl expression in response to 
auxin suggests that the response of BnSCLl to auxin is modulated by histone 

25 deacetylase. 

These results suggest a molecular mechanism by which BnSCLl functions as a 
transcription factor to regulate gene expression by recruiting HDAC to the promoter 
regions of target genes. 

30 

Example 6: Modulation of activity of a gene of interest using a recruitment 
factor 
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Two constrt^s are prepared: 1) an activatorfreporter construct (Fig. 29B) 
carrying the lacZ reporter gene downstream from a Tet operator sequence (Tet-7X), 
and the BnSCLl "and VP16 genes encoding a VP16-SCL fusion protein that is able to 
bind the Tet operator sequence; and 2) an effector construct carrying the HDA19 gene 
5 (Fig. 29B). 

The activator+report construct is introduced and expressed in yeast cells, for 
example MaV203 cells as described in Example 4, to produce a reporter yeast. 
Activity of lacZ product is quantified by measuring the /3-galactosidase activity using 
10 chlorophenol red-/3-D-galactopyranoside (CPRG) (GibcoL BRL). In the reporter 
, yeast, expression of the activatorfreporter construct results in the expression of the 
VP16-SCL fusion protein that binds to the Tet operator sequence, thereby activating 
expression of the LacZ reporter gene due to VP 16. 

15 The reporter yeast expressing the activatorfreporter construct is treated with 

tetracycline. Expression of lacZ reporter gene is quantified by measuring the /3- 
galactosidase activity using chlorophenol red-jS-D-galactopyranoside (CPRG). The 
expression of the activatorfreporter construct in the presence of tetracycline in yeast 
cells produces a baseline level of LacZ activity. 

20 

The effector construct is then introduced into the reporter yeast so that the 
activatorfreporter and the effector constructs are both expressed, and the activity of 
the LacZ product determined as indicated above. Results demonstrate that LacZ 
activity is reduced in the yeast expressing both the activator+reporter and the effector 
25 constructs, when compared to LacZ activity determined in the reporter yeast 
expressing only the activator+reporter construct, and approximates the level of 
activity of LacZ activity produced by the reporter yeast when treated with 
tetracycline. 

30 This result indicate that the expression of a gene of interest (in this case LacZ) 

may be reduced by targeting a recruitment factor, for example SCL1, to the nucleotide 
sequence encoding the gene of interest, and permitting the recruitment factor to bind 
anHDAC. 



WO 03/104462 



PCT/CA03/00822 




82 




A similar ^^of assays is carried out comprising tfiree constructs: 1) a 
reporter construct carrying the lacZ reporter gene, 2) an activator cpnstruct carrying 
the BnSCLl and VP 16 genes, and 3) an effector construct carrying the HDA19 gene 
(see Fig. 29A). The constructs are expressed in yeast cells, for example MaV203 
5 cells as described above in Example 4, in the following combinations: 
reporter construct alone, 
reporter and activator constructs, 
reporter, activator and effector constructs. 
The expression of lacZ reporter gene is quantified by measuring the jS-galactosidase 
10 activity using chlorophenol red-j3-D-galactopyranoside (CPRG) (GibcoL BRL). 



activator constructs yields an elevated level of j3-galactosidase activity, when 
15 compared with the activity observed in the presence of the reporter construct alone, 
while the reporter, activator and effector constructs together results in approximately 
background levels of jS-galactosidase activity. 



The present invention has been described with regard to preferred 
embodiments. However, it will be obvious to persons skilled in the art that a number 
of variations and modifications can be made without departing from the scope of the 
invention as described herein. 



The expression of the reporter construct alone in yeast cells produces a 
baseline level of jS-galactosidase activity. Expression of both the reporter and 
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THE EMBODIMENTS OF THE INVENTION IN WHICH^N EXCLUSIVE 
PROPERTY OF PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS: 

1. A method to regulate expression of a nucleic acid sequence of interest 
5 comprising: 

i) providing a eukaryote having: 

1) a first nucleotide sequence comprising, 

a) said nucleic acid sequence of interest operatively linked to a first 

regulatory region, 

1Q b) an operator sequence capable of binding a fusion protein, and; 

2) a second nucleotide sequence comprising a second regulatory region in 
operative association with a nucleotide sequence encoding said fusion protein, 
said fusion protein comprising, 

a) a DNA binding protein, or a portion thereof, capable, of binding said 

15 operator sequence, and; 

b) a recruitment factor protein, or a portion thereof, capable of binding 
a chromatin remodelling protein; and 

■ ii) growing said eukaryote, wherein expression of said second nucleotide sequence 
produces said fusion protein that regulates expression of said nucleic acid sequence of 
20 interest. 

2. The method of claim 1 , wherein the eukaryote is a plant. 

3. The method of claim 1, wherein in said step of introducing (step i)), said 
25 operator sequence is selected from the group consisting of a ROS operator, a Tet 

operator, Sin3, VP16, GAL4, Lex A, UMe6, ERF, SEBF, CBF and a DNA binding 
domain of a transcription factor. 

The method of claim 1, wherein the recruitment factor is characterized as 
30 having a histone deacetylase binding domain or a histone acetylase binding domain. 

4. The method of claim 1, wherein in said step of introducing (step ii)), said 
recruitment factor protein is selected from the group consisting of histone acetylase 
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tone deacetylase recruitment factor, KID, ADA, SAGA, 
STAGA, PCAF, TFIDD, TFIHC, bnKCPl and BnSCLl. 



5. A method of enhancing expression of a nucleic acid sequence of interest 
5 comprising: 

i) providing a plant with one or more constructs comprising: 

1) a first nucleotide sequence comprising, 

a) said nucleic acid sequence of interest operatively linked to a first 
regulatory region, and; 
10 b) an operator sequence capable of binding a fusion protein; 

2) a second nucleotide sequence comprising a second regulatory region in 
operative association with a nucleotide sequence encoding said fusion protein 
comprising, 

a) a DNA binding protein, or a portion thereof capable of binding said 
15 operator sequence, and; 

b) a recruitment factor, or portion thereof, that binds a histone 
acetyltransferase (HAT) protein; 

ii) growing said plant, and 

iii) expressing said second nucleotide sequence such that said fusion protein is 
20 produced and expression of said nucleic acid sequence of interest is increased. 

6. The method of claim 5, wherein the second regulatory region comprises an 
inducible promoter. 



25 7. The method of claim 5, wherein the HAT is Gcn5 . 

8. The method of claim 5, wherein in said step of introducing (step i)), said 
operator sequence is selected from the group consisting of a ROS operator, a Tet 
operator, Sin3, VP16, GAL4, Lex A, UMe6, ERF, SEBF, CBF and a DNA binding 

3 0 domain of a transcription factor. 

9. A method for selectively controlling the transcription of a nucleic acid 
sequence of interest, comprising: 
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i) providing a first plant comprising a first genetic construct, said first genetic 
construct comprising a first regulatory region operatively linked to a nucleic 
acid sequence of interest and at least one ROS operator sequence capable of 
controlling the activity of said first regulatory region; 
5 ii) providing a second plant comprising a second genetic construct, said second 
genetic construct comprising a second regulatory region in operative 
association with a nucleic acid molecule encoding a fusion protein comprising 
a ROS repressor, or a fragment thereof, and a recruitment factor characterized 
as having a histone deacetylase binding domain, or a fragment thereof; 
10 iii) crossing said first plant and said second plant to obtain progeny, said progeny 
comprising both said first genetic construct and said second genetic construct, 
and characterized in that the expression of said second genetic construct 
represses expression of said first genetic construct. 

15 10. The method of claim 6, wherein said first and second regulatory regions are 
either the same or different and are selected from the group consisting of a 
constitutive promoter, an inducible promoter, a tissue specific promoter, and a 
developmental promoter. 

20 11. The method of claim 1 , wherein, in said step of introducing (step i)), said first, 
second, or both said first and second nucleotide sequences are incorporated into said 
plant by crossing. 



12. The method of claim 8, wherein said crossing comprises crossing a first plant 
25 comprising said first nucleotide sequence with a second plant comprising said second 

nucleotide sequence, to obtain progeny. 

13. The method of claim 1, wherein, in said step of introducing (step i)), said first, 
second, or both said first and second nucleotide sequences are incorporated into said 

30 plant by transformation. 



14. 



A method to regulate expression of an endogenous nucleic acid sequence of 
interest comprising: 
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i) providing a eukli^e having a nucleotide sequence comjftig, a regulatory 
region, operatively linked with a nucleotide sequence encoding a fusion protein, said 
fusion protein comprising, 

a) a DNA binding protein, or a portion thereof, capable of binding a segment 
of a DNA sequence of said endogenous nucleotide sequence of interest; and 

b) a recruitment factor protein, or a portion thereof; and 

ii) growing said eukaryote, wherein expression of said nucleotide sequence produces 
said fusion protein that regulates expression of said endogenous nucleic acid sequence 
of interest. 

15. The method of claim 11, wherein in said step of introducing (step i)), said 
recruitment factor protein is selected from the group consisting of histone acetylase 
recruitment factor, and histone deacetylase recruitment factor . 

16. An isolated nucleic acid sequence encoding the sequence of bnKCP 1 (SEQ ID 
NO:71). 

17. An isolated nucleic acid sequence encoding amino acids 1 to 80 of SEQ ID 
NO:71. 

18. An isolated nucleic acid sequence encoding amino acids 1 to 160 of SEQ ID 
NO:71. 

19. An isolated nucleic acid sequence encoding amino acids 81 to 215 of SEQ ID 
NO:71. 

20. The method of claim 1, wherein the recruitment factor protein is bnKCPl 
(SEQ ID NO:71) or a fragment thereof. 

21. The method of claim 11, wherein the recruitment factor protein is bnKCPl 
(SEQ ID NO:71) or a fragment thereof. 

22. An isolated nucleic acid encoding a bnKCPl fusion protein, GAL4DB- 
bnKCPl. 
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23. An isolated nucleic acid encoding a HDAC fusion protein, GAL4DB-HDAC. 

24. An isolated nucleic acid sequence encoding the sequence of BnSCL 1 (SEQ ID 
5 NO:81). 

25. An isolated nucleic acid sequence encoding amino acids 1 to 358 of SEQ ID 
NO:81. ' 

10 26. An isolated nucleic acid sequence encoding amino acids 1 to 261 of SEQ ID 
NO:81. 

27. An isolated nucleic acid sequence encoding amino acids 1 to 217 of SEQ ID 
NO:81. 

15 

28. An isolated nucleic acid sequence encoding amino acids 146 to 358 of SEQ ID 
NO:81. 

29. The method of claim 1, wherein the recruitment factor protein is BnSCLl 
20 (SEQ ID NO:81) or a fragment thereof. 

30. The method of claim 11, wherein the recruitment factor protein is BnSCLl 
(SEQ ID NO:81) or a fragment thereof. 

25 31. A method to regulate expression of a nucleic acid sequence of interest in a 
plant comprising: 
i) introducing into said plant: 

1) a first nucleotide sequence comprising, 

a) said nucleic acid sequence of interest operatively linked to a first 
30 regulatory region, 

b) an operator sequence capable of binding a bnKGP-fusion protein, 

and; 



SUBSTITUTE SHEET (RULE 26) 



10 



WO 03/104462 PCT/CA03/00822 

2) a second nWPrtide sequence comprising a second r^Patory region ni 
operative association with a nucleotide sequence encoding said bnKCP-fusibn 
protein, said bnKCP-fusion protein comprising, 

a) a DNA binding protein, or a portion thereof, capable of binding said 
operator sequence, and; 

b) a bnKCPl, or a portion thereof; and 

ii) growing said plant, wherein expression of said second nucleotide sequence 
produces said fusion protein that regulates expression of said nucleic acid sequence of 
interest. 



32. A method to regulate expression of a nucleic acid sequence of interest in a 
plant comprising: 
i) introducing into said plant: 

1) a first nucleotide sequence comprising, 

15 a ) said nucleic acid sequence of interest operatively linked to a first 

regulatory region, 

b) an operator sequence capable of binding a BnSCL-fusion protein, 
and; 

2) a second nucleotide sequence comprising a second regulatory region in 
20 operative association with a nucleotide sequence encoding said BnSCL-fusion 

protein, said BnSCL-fusion protein comprising, 

a) a DNA binding protein, or a portion thereof, capable of binding said 
operator sequence, and; 

b) a BnSCLl, or a portion thereof; and 

25 ii) growing said plant, wherein expression of said second nucleotide sequence 
produces said fusion protein that regulates expression of said nucleic acid sequence of 
interest. 

33 . A method of increasing cold tolerance in a plant, comprising: 
30 j) providing a plant having a nucleotide sequence of interest operatively 

linked to a first regulatory region, the nucleotide sequence of interest 
encoding bnKCPl, or fragments thereof; and 
ii) mamtaining the plant under conditions where bnKCPl is expressed 
thereby increasing cold tolerance in the plant. 
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34. A method of controlling expression of a nucleic acid sequence of interest, 

comprising: 

i) providing a eukaryote having: 

1) a first nucleotide sequence comprising 

a) said nucleic acid sequence of interest operatively linked to a first 
regulatory region, 

b) an operator sequence capable of binding a fusion protein, and 

c) a second regulatory region in operative association with a nucleotide 
sequence encoding said fusion protein, the fusion protein including a 
DNA binding protein, or a portion thereof, capable of binding said 
operator sequence and a recruitment factor protein, or a portion 
thereof, capable of binding a chromatin remodelling protein; and 

2) a second nucleotide sequence comprising a third regulatory region in 

operative association with a nucleotide sequence encoding a chromatin 
remodelling protein; and 

ii) growing said eukaryote, wherein expression of said first nucleotide sequence 
produces said fusion protein that increases expression of said nucleic acid sequence of 
interest and wherein expression of said second nucleotide sequence produces said 
chromatin remodelling protein to repress expression of said nucleic acid sequence of 
interest. 

35. The method of claim 34, wherein the chromatin remodelling protein is 
HDA19. 

36. The method of claim 35, wherein the recruitment factor protein is BnSCLl or 
bnKCPl. 

37. The method of claim 35, wherein the DNA binding protein is VP16 or GAL4. 

38. A method of controlling expression of a nucleic acid sequence of interest, 

comprising: 

i) providing a eukaryote having: 

1) a first nucleotide sequence comprising, 
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a) sail^Fcleic acid sequence of interest operati^^ linked to a first 
regulatory region, and 

b) an operator sequence capable of binding a fusion protein, and 
2) a second nucleotide sequence comprising a regulatory region in operative 
5 association with a nucleotide sequence encoding said fusion protein, 

the fusion protein including a DNA binding protein, or a portion 
thereof, capable of binding said operator sequence and a recruitment 
factor protein, or a portion thereof, capable of binding a chromatin 
remodelling protein; and 
10 2) a third nucleotide sequence comprising a third regulatory region in 

operative association with a nucleotide sequence encoding a chromatin 
remodelling protein; and 
ii) growing said eukaryote, wherein expression of said second nucleotide sequence 
produces said fusion protein that increases expression of said nucleic acid sequence of 
15 interest and wherein expression of said third nucleotide sequence produces said 
chromatin remodelling protein to repress expression of said nucleic acid sequence of 
interest. 

39. The method of claim 38, wherein the chromatin remodelling protein is 
20 HDA19. 

40. The method of claim 39, wherein the recruitment factor protein is BnSCLl or 
bnKCPl. 

25 41. The method of claim 39, wherein the DNA binding protein is VP 1 6 or GAL4. 
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GCGGATCCCC GGGTA TGACT 

TCAGGATCTT CTTGTTGAGC 

GCTTACGTTT CTAACCACGT 

CTGGACTTAT CTCTGATGTT 

ATCTGCTCCT GCTTCTGTTG 

AAGCCTGCTG TTTCTGTTCG 

ATATCGTTTG TTTGGAGTGT 

CAAGCGTCAC CTTACTACTC 

GAGTATAGAG AGAAGTGGGA 

TGGTTGCTCC TGCTTACGCT 

TAAGGAGATG GGTCTCGGTC 

CCAAAAAAGA AGCGTAAGGT 
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GAGACTGCTT ACGGTAACGC 

TTACTGCTGA TATCGTTGCT 

TGTTCCTGTT . ACTGAGCTTC 
CATACTGCAC ' TTTCTGGAAC 

CTGTTAACGT TGAGAAGCAG 

TAAGTCTGTT CAGGATGATC 

GGTGGTTCTT TCAAGTCTCT 

ATCACTETAT GACTCCAGAG 

TCTTCCTGTT GATTACCCTA 

GAGGCTCGTT CTCGTCTCGC 

AGCGTCGTAA GGCTAACCGT 

CTGAGAGCTC GC 



FIG. IB 
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Comparison of ROS DNA Binding Site (Operator) Sequences 
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SEQUENCE LISTING 



<110> Her Majesty the Queen in Right of Canada as Represented by the 
Minister of Agriculture and Agri-Food 



<120> Regulation Of Gene Expression Using Chromatin Remodelling Factors 



<130> 08-890985WO 

<150> US 60/387,088 
<151> 2002-06-06 

<160> 104 

<170> Patentln version 3.0 

<210> 1 

<211> 142 

<212> PRT 

<213> WT-ROS 

<400> 1 

Met Thr Glu Thr Ala Tyr Gly Asn Ala Gin Asp Leu Leu Val Glu Leu 
1 5 10 15 

Thr Ala Asp He Val Ala Ala Tyr Val Ser Asn His Val Val Pro Val 
20 25 30 

Thr Glu Leu Pro Gly Leu He Ser Asp Val His Thr Ala Leu Ser Gly 
35 40 45 

Thr Ser Ala Pro Ala Ser Val Ala Val Asn Val Glu Lys Gin Lys Pro 
50 55 60 

Ala Val Ser Val Arg Lys Ser Val Gin Asp Asp His He Val Cys Leu 
65 70 75 80 

Glu Cys Gly Gly Ser Phe Lys Ser Leu Lys Arg His Leu Thr Thr His 
85 9° 95 

His Ser Met Thr Pro Glu Glu Tyr Arg Glu Lys Trp Asp Leu Pro Val 



1 
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100 105 110 

Asp Tyr Pro Met Val Ala Pro Ala Tyr Ala Glu Ala Arg Ser Arg Leu 
115 ' 120 125 

Ala Lys Glu Met Gly Leu Gly Gin Arg Arg Lys Ala Asn Arg 
130 ^ 135 140 

<210> 2 

<211> 472 

<212> DNA 

<213> synthetic ROS 



gcggatcccc gggtatgact gagactgctt acggtaacgc tcaggatctt cttgttgagc 60 

ttactgctga tatcgttgct gcttacgttt ctaaccacgt tgttcctgtt actgagcttc 120 

ctggacttat ctctgatgtt catactgcac tttctggaac atctgctcct gcttctgttg 180 

ctgttaacgt tgagaagcag aagcctgctg tttctgttcg taagtctgtt caggatgatc 240 

atatcgtttg tttggagtgt ggtggttctt tcaagtctct caagcgtcac cttactactc 300 

atcactctat gactccagag gagtatagag agaagtggga tcttcctgtt gattacccta 360 

tggttgctcc tgcttacgct gaggctcgtt ctcgtctcgc taaggagatg ggtctcggtc 420 

agcgtcgtaa ggctaaccgt ccaaaaaaga agcgtaaggt ctgagagctc gc 472 

<210> 3 
<211> 447 
<212> DNA 
<213> concensus 



<220> 

< 2 2 1 > mi s c_f eature 

<222> (1) . - (447) 

<223> where n is "a" or "c" or "t» or "g" 
<220> 

<221> mis cofeature 

<222> (1) . . (447) 



2 



WO 03/104462 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



mi s c_f ea t ur e 
(1) . . (447) 

where v is "a" or "c" or "g" 



mi s cofeature 
(1) . . (447) 

where k is "t" or "g" 



misc__feature 
(1) . - (447) 

where s is "c" or "g" 



misc__feature 
(1) . . (447) 

where w is "a" or "t" 

mi s cofeature 
(1) . . (447) 

where h is "a" or "t" or »c" 



mis cofeature 
(1) ~ . (447)" ' 

where b is "t" or "c" or "g" 



3 
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<220> 

< 2 2 1 > mis c_f e ature 

<222> (1)..(447) 

<223> where r is "a" or "g" 

<220> 

<221> misc_feature 

<222> (1) ■ • (447) 

<223> where y is »c« or «t» 
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<220> 

< 2 2 1 > mi s cofeature 

<222> (1) . ■ (447) 

<223> where m is "a" or «C 



3£LgU =n g =»ta rag naaygccar gayytnytng tngarytnac ngcngayath 
gtngcngcn* ycaygtngtn ccngtnacng arytnccngg nytnathwsn 

9 ayg«=»y» =ng=ny»»e nggnacnwon gc„=«ng=»w «t«« aaaygtnga* 
aarcaraarc cngcngtnwe ngtn-anaar wangtncarg aygaycayat hgtntgyytn 
gaxtgyg^ gnwsnttyaa rwanytnaa, ..gncayytna cnacncayc. yws«atg,=n 
ccngargart aymgngaraa «ggg.yyl=» ccngtngayt ayccnatggt ngcnccngcn 
taygcngarg cnmgnwsnmg nytngcnaar garatgggay tnggncarmg nmgnaargcn 
aaymgnccna araaraarmg naargtn 



<210> 4 

<211> 149 

<212> PRT 

<213> synthetic ROS 



60 
120 
180 
240 
300 
360 
420 
447 



<400> 4 



« Thr Glu Thr Ala Ty, Oly » «a Gin «P Leu Leu val Glu Leu 
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, • 9 

Thr Ala Asp He Val Ala Ala Tyr Val Ser Asn His Val Val Pro Val 

20 25 
Thr Glu Leu Pro Gly Leu He Ser Asp Val His Thr Ala Leu Ser Gly 

35 40 45 

Thr Ser Ala Pro Ala Ser Val Ala Val Asn Val Glu Lys Gin Lys Pro 

50 55 6° 

Ala Val Ser Val Arg Lys Ser Val Gin Asp Asp His He Val Cys Leu 
65 70 75 

r Glu Cys Gly Gly Ser Phe Lys Ser Leu Lys Arg His Leu Thr Thr His 

His Ser Met Thr Pro Glu Glu Tyr Arg Glu Lys Trp Asp Leu Pro Val 



100 



105 



Asp Tyr Pro Met Val Ala Pro Ala Tyr Ala Glu Ala Arg Ser Arg Leu 

Ala Lys Glu Met Gly Leu Gly Gin Arg Arg Lys Ala Asn Arg Pro Lys 
130 135 

Lys Lys Arg Lys Val 
145 

<210> 5 

<211> 10 

<212> PRT 

<213> ROS binding 



<400> 5 

Trp Ala Thr Asp His 
1 5 

<210> 6 

<211> 7 

<212> PRT 

<213> NLS 



Trp Lys Met Ala Arg 
10 



<400> 6 

Pro Lys Lys Lys Arg Lys Val 
! 5 

<210> 7 
<211> 25 



5 
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<212> DNA 

<213> ROS operator 

<400> 7 25 
tatatttcaa ttttattgta atata 

<210> 8 
<211> 27 
<212> DNA 

<213> IPT gene operator 

<400> 8 ^ u 27 

tataattaaa atattaactg tcgcatt 

<210> 9 
<211> 11 
<212> DNA 

<213> operator sequence binding to ERF 

<400> 9 11 
taagagccgc c 

<210> 10 
<211> 9 
<212> DNA 

<213> operator sequence binding to SEBF 

<400> 10 5 
gactgtcac 

<210> 11 

<211> 9 

<212> DNA 

<213> operator sequence binding to CBF 



6 



WO 03/104462 PCT/CA03/00822 



<400> 11 
taccgacat 

<210> 12 

<211> 8 

<212> DNA 

<213> operator sequence binding to CBF 

<400> 12 
tggccgac 

<210> 13 
<211> 16 
<212> PRT 

<213> NLS of AGAMOUS protein 
<400> 13 

Arg He Glu Asn Thr Thr Asn Arg Gin Val Thr Phe Cys Lys Arg Arg 

1 5 

<210> 14 
<211> 18 
<212> PRT 

<213> NtS of TGA-1A protein 
<400> 14 

Arg Arg Leu Ala Gin Asn Arg Glu Ala Ala Arg Lys Ser Arg Leu Arg 

Lys Lys 

<210> 15 

<211> 21 

<212> PRT 

<213> NLS of TGA- IB protein 

<400> 15 



7 
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Lys Lys Arg Ala Arg Leu Val Arg Asn Arg Glu Ser Ala Gin Leu Ser 
1 ^ 5 10 15 

Arg Gin Arg Lys Lys 
20 

<210> 16 
<211> 18 
<212> PRT 

<213> NLS of 02 NIiS B protein 



<400> 16 

Arg Lys Arg Lys Glu Ser Asn Arg Glu Ser Ala Arg Arg Ser Arg Tyr 
15 1° 15 

Arg Lys 

<210> 17 

<211> 45 

<212> PRT 

<213> NLS of NIa protein 



<220> 

<221> LIPID 

<222> (1) . . (45) 

<223> where "x" is any amino acid 

<400> 17 

Lys Lys Asn Gin Lys His Lys Leu Lys Met Xaa Xaa Xaa Xaa Xaa Xaa 

1 5 10 15 - 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys Arg Lys 
35 40 ■ 45 

<210> 18 
<211> 16 
<212> PRT 

<213> NLS nucleoplasmin protein 

8 
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<400> 18 

Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gin Ala Lys Lys Lys Lys 
1 5 10 

<210> 19 

<211> 17 

<212> PRT 

<213> NLS of N038 protein 



<400> 19 

Lys Arg He Ala Pro Asp Ser Ala Ser Lys Val Pro Arg Lys Lys Thr 

l 5 

Arg 



<210> 20 

<211> 17 

<212> PRT 

<213> NLS of N1/N2 protein 

<400> 20 

Lys Arg Lys Thr Glu Glu Glu Ser Pro Leu Lys Asp Lys Asp Ala Lys 

! ~ 5 10 

Lys 

<210> 21 

<211> 17 

<212> PRT 

<213> NLS of Glucocorticoid receptor 



<400> 21 

Arg Lys Cys Leu Gin Ala Gly Met Asn Leu Glu Ala Arg Lys Thr Lys 

1 5 

Lys 

<210> 22 
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<211> 17 
<212> PRT 

<213> NLS of Glucocorticoid a receptor 
<400> 22 

Arg Lys Cys Leu Gin Ala Gly Met Asn Leu Glu Ala Arg Lys Thr Lys 
1 5 10 15 

Lys 

<210> 23 

<211> 17 

<212> PRT 

<213> NLS of Glucocorticoid b receptor 

<400> 23 

Arci Lys Cys Leu Gin Ala Gly Met Asn Leu . Glu Ala Arg Lys Thr Lys 
1 5 10 15 

Lys 

<210> 24 
<211> 17 
<212> PRT 

<213> NLS of Progesterone receptor 
<400> 24 

Arg Lys Cys Cys Gin Ala Gly Met Val Leu Gly Gly Arg Lys Phe Lys 
15 10 15 

Lys 

<210> 25 

<211> 17 

<212> PRT 

<213> NLS of Androgen receptor 



10 
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<400> 25 

Arq Lys Cys Tyr Glu Ala Gly Met Thr Leu Gly Ala Arg Lys Leu Lys 
x 5 10 15 

Lys 



<210> 26 

<211> 17 

<212> PRT 

<213> NLS of p53 protein 

<400> 26 

Arg Arg Cys Phe Glu Val Arg Val Cys Ala Cys Pro Gly Arg Asp Arg 
1 5 10 . 15 

<210> 27 

<211> 25 

<212> DNA 

<213> VirC/VirD operator sequence 



<400> 27 

tatatttcaa ttttattgta atata " 

<210> 28 

<211> 108 

<212> DNA 

<213> ROS-OPDS 



<400> 28 „ 
atctccactg acgtaag'gga tgacgcacaa tcccactatc cttcgcaaga cccttcctct 60 

atataatata tttcaatttt attgtaatat aacacggggg actctaga 108 

<210> 29 
<211> 113 
<212> DNA 



11 
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<213> ROS-OPDA 

gaScctctag agtcccccgt gttatattac aataaaattg aaatatatta tatagaggaa 60 
gggtcttgcg aaggatagtg ggattgtgcg tcatccctta cgtcagtgga gat 113 

<210> 30 
<211> 107 
<212> DNA 
<213> ROS-OPUS 

aJcJccactg acgtaaggga tgacgcacaa tctatatttc aattttattg taatatacta 60 
tataaggaag ttcatttcat ttggagagaa cacgggggac tctagag 1° 7 

<210> 31 
<211> 111 
<212> DNA 
<213> ROS-OPUA 

^atcctc^ag agtcccccgt gttctctcca aatgaaatga acttccttat atagtatatt 60 
acaataaaat tgaaatatag attgtgcgtc atcccttacg tcagtggaga t HI 

<210> 32 
<211> 108 
<212> DNA 
<213> ROS-OPPS 

aJc2ccactg acgtaaggga tgacgcacaa tctatatttc aattttattg taatatacta 60 
tataatatat ttcaatttta ttgtaatata acacggggga ctctagag ^ e 

<210> 33 
<211> 112 
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<212> DNA 
<213> ROS-OPPA 

gitcStctag agtcccccgt gttatattac aataaaattg aaatatatta tatagtatat 60 
tacaataaaa ttgaaatata gattgtgcgt catcccttac gtcagtggag at 112 

<210> 34 
<211> 59 
<212> DNA 
<213> ROS-OP1 

ga2ccta?at ttcaatttta ttgtaatata gctatatttc aattttattg taatataat 59 

<210> 35 
<2X1> 57 
<212> DNA 
<213> ROS-OP2 

cgattatatt acaataaaat tgaaatatag ctatattaca ataaaattga aatatag 57 

<210> 36 

<211> 36 

<212> DNA 

<213> tms2 promoter sense primer 



<400> 36 36 
tgcggatgca taagcttgct gacattgcta gaaaag 

<210> 37 

<211> 26 

<212> DNA 

<213> tms2 promoter anti-sense primer 
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<400> 



37 



26 



cggggatcct ttcagggcca tttcag 

<210> 38 
<211> 24 
<212> DNA 

<213> actin2 promoter sense primer 
<400> 38 

aagcttatgt atgcaagagt cage 

<210> 39 
<211> 24 
<212> DNA 

<213> actin2 promoter anti- sense primer 
<400> 39 

ttgactagta tcagcctcag ccat 

<210> 40 

<211> 138 

<212> DNA 

<213> EcoRV to ATG of GUS 



ggtggtcagt cccttatg 

<210> 41 

<211> 136 

<212> DNA 

<213> EcoRV to ATG of GUS 

<400> 41 




120 



60 



138 
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gatatctcca ctgacgtaag ggatgacgca caatctatat ttcaatttta ttgtaatata 60 
ctatataagg aagttcattt catttggaga gaacacgggg gactctagag gatccccggg 120 

13 6 

tggtcagtcc cttatg 

<210> 42. 

<211> 137 

<212> DNA 

<213> EcoRV to ATG of GUS 



gatatctcca ctgacgtaag ggatgacgca caatctatat ttcaatttta ttgtaatata 
ctatataata tatttcaatt ttattgtaat ataacacggg ggactctaga ggatccccgg 
gtggtcagtc ccttatg 



<210> 43 
<211> 237 
<212> DNA 

<213> EcoRV to ATG of GUS 

Jatatctcca ctgacgtaag ggatgacgca caatcccact atccttcgca agacccttcc 
tctatataat atatttcaat tttattgtaa tataacacgg gggactctag aggatcctat 
atttcaattt tattgtaata tagctatatt tcaattttat tgtaatataa tcgatttcga 
acccggggta ccgaattcct cgagtctaga ggatccccgg gtggtcagtc ccttatg 

<210> 44 

<211> 31 

<212> DNA 

<213> forward primer for HDA19 A. thaliana, pDBLeu - HD Al 9 



<400> 44 

gcgtcgacga tggatactgg cggcaattcg c 



<210> 45 
<211> 32 



137 



60 
120 
180 
237 



31 
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<212> DNA 

<213> reverse primer for HDA19 A. thaliana, pDBLeu-HDA19 

<400> 45 „ 
aggcggccgc ttatgtttta ggaggaaacg cc 

<210> 46 
<211> 31 
<212> DNA 

<213> forward primer for Gen5 Arabidopsis, GST-Gen5 

<400> 46 31 
gcgtcgacga tggactctca ctcttcccac c 

<210> 47 
<211> 31 
<212> DNA 

<213> reverse primer for Gen5 Arabidopsis, GST-Gen5 

<400> 47 31 
gcgcggccgc ctattgagat ttagcaccag a 

<210> 48 

<211> 31 

<212> DNA 

<213> reverse primer f orHDA19 , GST-HDA19 



<400> 48 

gcgcggccgc ttatgtttta ggaggaaacg c 

<210> 49 
<211> 29 



31 



<212> DNA 

<213> forward primer for bnKCPl, 1-80, 1-160 (generation of mutants) 
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<400> 49 

gcaagcttat ggcaggagga ggaccaact 



PCT/CA03/00822 



29 



<210> 50 

<211> 29 

<212> DNA 

<213> reverse primer for bnKCPl 1-160 (generation of mutants) 

<400> 50 

cgctcgagct cctccteatc attgtcttc 29 

<210> 51 

<211> 29 

<212> DNA 

<213> reverse primer for bnKCPl 1-80 (generation of mutants) 



<210> 52 
<211> 29 
<212> DNA 

<213> reverse primer for bnKCPl (generation of mutants) 
<400> 52 

cgctcgagct catcttcttc ttcttcttc 29 

<210> 53 
<211> 30 
<212> DNA 

<213> forward primer for bnKCPl, 1-80 and 1-160 (in vivo assay and 
transactivation assay) 



<400> 51 

cgctcgagat gaacaggcaa aagaggcat 



29 



<400> 53 

gcgtcgacga tggcaggagg aggaccaact 



30 



<210> 54 
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<211> 31 
<212> DNA 

<213> reverse primer for bnKCPl 

<400> 54 31 
gcgcggccgc ctcatcttct tcttcttcct c 

<210> 55 
<211> 31 
<212> DNA 

<213> reverse primer for bnKCPl 

<400> 55 4 31 

gcgcggccgc atgaacaggc aaaagaggca t 

<210> 56 
<211> 31 
<212> DNA 

<213> reverse primer for bnKCPl 

<400> 56 21 
gcgcggccgc ctcctcctca tcattgtctt c 

<210> 57 
<211> 45 

<212> DNA 

<213> forward primer for bnKCPlG188 



<400> 57 . 

gatgttcttg cgaggagacc aggattcaag aacagagcat tgaag 



45 



<210> 58 



<211> 45 



<212> DNA 



<213> reverse primer for bnKCPlG188 
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45 



<210> 59 

<211> 30 

<212> DNA 

<213> forward primer for bnKCPl 81-215 

<400> 59 

gcgtcgacgc tagggttggc ttcattgaga 30 

<210> 60 

<211> 29 

<212> DNA 

<213> forward primer for entire encoding region of bnKCPl 



<210> 61 

<211> 29 

<212> DNA 

<213> reverse primer for entire coding region of bnKCPl 

<400> 61 

cggagctcct catcttcttc ttcttcttc 29 

<210> 62 

<211> 7 

<212> PRT ' 

<213> pat7 NLS (PLNKKRR) 

<400> 62 

Pro Leu Asn Lys Lys Arg Arg 
1 5 



<400> 60 

gcgaattcat ggcaggagga ggaccaact 



29 



<210> 



63 
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<211> 143 
<212> PRT 

<213> aa seq of ROSR (ROS receptor) 
<400> 63 

Met Thr Asp Met Ala Thr Gly Asn Ala Pro Glu Leu Leu Val Glu Leu 



1- — - 5 io 



15 



Thr Ala Asp lie Val Ala Ala Tyr Val Ser Asn His Val Val Pro Val 
20 25 30 

Ser Asp Leu Ala Asn Leu lie Ser Asp Val His Ser Ala Leu Ser Asn 
J = 40 45 

Thr Ser Val Pro Gin Pro Ala Ala Ala Val Val Glu Lys Gin Lys Pro 
30 55 go 

Ala Val Ser Val Arg Lys Ser Val Gin Asp Glu Gin He Thr Cys Leu 

70 75 80 

Glu Cys Gly Gly Asn Phe Lys Ser Leu Lys Arg His Leu Met Thr His 
85 90 95 

His Ser Leu Ser Pro Glu Glu Tyr Arg Glu Lys Trp Asp Leu Pro Thr 
100 105 HO 

Asp Tyr Pro Met Val Ala Pro Ala Tyr Ala Glu Ala Arg Ser Arg Leu 
115 120 125 

Ala Lys Glu Met Gly Leu Gly Gin Arg Arg Lys Arg Gly Arg Gly 
130 135 140 

<210> 64 

<211> 142 

<212> PRT 

<213> aa seq of ROSAR (ROS receptor) 



<400> 64 

Met Thr Glu Thr Ala Tyr Gly Asn Ala Gin Asp Leu Leu Val Glu Leu 
5 10 i 5 

Thr Ala Asp lie Val Ala Ala Tyr Val Ser Asn His Val Val Pro Val 

Thr Glu Leu Pro Gly Leu He Ser Asp Val His Thr Ala Leu Ser Gly 
Jb 40 



45 



Thr Ser Ala Pro Ala Ser Val Ala Val Asn Val Glu Lys Gin Lys Pro 

55 60 

Ala Val Ser Val Arg Lys Ser Val Gin Asp Asp His He Val Cys Leu 
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65 70 75 80 

Glu Cys Gly Gly Ser Phe Lys Ser Leu Lys Arg His Leu Thr Thr His 
85 90 

His Ser Met Thr Pro Glu Glu Tyr Arg Glu Lys Trp Asp Leu Gin Val 
100 105 110 

Asp Tyr Pro Met Val Ala Pro Ala Tyr Ala Glu Ala Arg Ser Arg Leu 
115 120 125 

Ala Lys Glu Met Gly Leu Gly Gin Arg Arg Lys Ala Asn Arg 
130 135 1*0 

<210> 65 
<211> 143 
<212> . PRT 

<213> aa seq of MucR (ROS receptor) 



<400> 65 

Met Thr Glu Thr Ser Leu Gly Thr Ser Asn Glu Leu Leu Val Glu Leu 
1 5 10 15 

Thr Ala Glu He Val Ala Ala Tyr Val Ser Asn His Val Val Pro Val 
20 25 30 

Ala Glu Leu Pro Thr Leu He Ala Asp Val His Ser Ala Leu Asn Asn 
35 40 45 

Thr Thr Ala Pro Ala Pro Val Val Val Pro Val Glu Lys Pro Lys Pro 
50 55 60 

Ala Val Ser Val Arg Lys Ser Val Gin Asp Asp Gin He Thr Cys Leu 
65 70 75 80 

Glu Cys" Gly Gly Thr Phe Lys Ser Leu Lys Arg His Leu Met Thr His 
85 9° 95 

His Asn Leu Ser Pro Glu Glu Tyr Arg Asp Lys Trp Asp Leu Pro Ala 
100 105 110 

Asp Tyr Pro Met Val Ala Pro Ala Tyr Ala Glu Ala Arg Ser Arg Leu 
115 120 125 

Ala Lys Glu Met Gly Leu Gly Gin Arg Arg Lys Arg Arg Gly Lys 
130 135 . I 40 

<210> 66 

<211> 10 

<212> DNA 

<213> VirC/VirD DNA binding site seq (1) 
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<400> 66 
tatatttcaa 

<210> 67 

<211> 10 

<212> DNA 

<213> Virc/VirD DNA binding site seq (2) 

<400> 67 
tatattacaa 

<210> 68 

<211> 10 

<212> DNA 

<213> ipt DNA binding site seq (1) 

<400> 68 
tataattaaa 

<210> 69 

<211> 10 

<212> DNA 

<213> ipt DNA binding site seq (2) 

<400> 69 
aatgcgacag 

<210> 70 

<211> 10 

<212> DNA 

<213> concensus DNA binding site seq 

<400> 70 
tatahttcaa 



<210> 71 
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<211> 215 
<212> PRT 
<213> bnKGP aa seq 

<400> 71 

Met Ala Gly Gly Gly Pro Thr Phe Ser lie Glu Leu Ser Ala Tyr Gly 
1 5 10 15 

Ser Asp Leu Pro Thr Asp Lys Ala Ser Gly Asp He Pro Asn Glu Glu 
20 25 30 

Gly Ser Gly Leu Ser Arg Val Gly Ser Gly He Trp Ser Gly Arg Thr 
35 40 45 

Val Asp Tyr Ser Ser Glu Ser Ser Ser Ser He Gly Thr Pro Gly Asp 
50 55 60 

Ser Glu Glu Glu Asp Glu Glu Ser Glu Glu. Asp Asn Asp Glu Glu Glu 
65 70 75 80 

Leu Gly Leu Ala Ser Leu Arg Ser Leu Glu Asp Ser Leu Pro Ser Lys 
85 90 95 

Gly Leu Ser Ser His Tyr Lys Gly Lys Ser Lys Ser Phe Gly Asn Leu 
100 105 HO 

Gly Glu He Gly Ser Val Lys Glu Val Pro Lys Gin Glu Asn Pro Leu 
115 " 120 125 

Asn Lys Lys Arg Arg Leu Gin He Tyr Asn Lys Leu Ala Arg Lys Ser 
130 135 140 

Phe Tyr Ser Trp Gin Asn Pro Lys Ser Met Pro Leu Leu Pro Val His 
145 150 155 160 

Glu Asp Asn Asp Asp Glu Glu Gly Asp Asp Gly Asp Leu Ser Asp Glu 
165 170 175 

Glu Arg Gly Gly Asp Val Leu Ala Arg Arg Pro Ser Phe Lys Asn Arg 
180 185 190 

Ala Leu Lys Ser Met Ser Cys Phe Ala Leu Ser Asp Leu Gin Glu Glu 
195 200 205 

Glu Glu Glu Glu Glu Asp Glu 
210 215 

<210> 72 

<211> 240 

<212> PRT 

<213> atKCP aa seq 
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•<400> 72 

Met Glu Leu Met Ala Lys Pro Thr Phe Ser He Glu Val Ser Gin Tyr 

1 5 



Gly 



Thr Thr Asp Leu Pro Ala Thr Glu Lys Ala Ser Ser Ser Ser Ser 
20 25 



Ser Phe Glu Thr Thr Asn Glu Glu Gly Val Glu Glu Ser Gly Leu Ser 
35 40 



Arg lie Trp Ser Gly Gin Thr- Ala Asp Tyr Ser Ser Asp Ser Ser Ser 

50 55 
lie Gly Thr Pro Gly Asp Ser Glu Glu Asp Glu Glu Glu Ser Glu Asn 



65 



70 



^1 q e r ser Lys Glu Leu Gly Leu Arg Gly Leu Ala 
Glu Asn Asp Asp Val Ser ber uys j 95 

85 90 . 

Ser Met Ser Ser Leu Glu Asp Ser Leu Pro Ser Lys Arg Gly Leu Ser 
100 105 

A.„ His Tyr Lys sly Lys ser Lys Ser Phe Gly Asn Leu sly Glu "« 

115 120 
Gly Ser Val Lys Glu Val Ala Lys Gin Glu Asn Pro Leu Asn Lys Arg 
130 135 



Arg Arg Leu Gin He Cys Asn Lys Leu Ala Arg Lys Ser Phe Tyr Ser 
145 150 

Trp Gin Asn Pro Lys Ser Met Pro Leu Leu Pro Val Asn Glu Asp Glu 
165 

Asp Asp Asp Asp Glu Asp Asp Asp Glu Glu Asp Leu Lys Ser Gly Phe 
180 185 

Asp Glu AS. Lye ser Ser Ser Asp Glu Glu Gly Val Lys Lys val Val 

195 20U 
Val Arg Lys Gly Ser Phe Lys Asn Arg Ala Tyr Ly. Ser Arg Ser Cys 

210 215 
Phe Ala Leu Ser Asp Leu He Glu Glu Glu Asp Asp Asp Asp Asp Gin 



230 



225 

<210> 73 

<211> 214 

<212> PRT 

<213> atKCLl aa seq - 



<400> 73 

Met Glu Val Leu Val Gly Ser Thr Phe Arg Asp Arg Ser Ser Val Thr 



24 



WO 03/104462 PCT/CA03/00822 



Thr His Asp Gin Ala Val Pro Ala Ser Leu Ser Ser Arg lie Gly Leu 
20 25 30 

Arg Arg Cys Gly Arg Ser Pro Pro Pro Glu Ser Ser Ser Ser Val Gly 
35 40 45 

Glu Thr Ser Glu Asn Glu Glu Asp Glu Asp Asp Ala Val Ser Ser Ser 
50 55 60 

Gin Gly Arg Trp Leu Asn Ser Phe Ser Ser Ser Leu Glu Asp Ser Leu 
65 70 75 80 

Pro lie Lys Arg Gly Leu Ser Asn His Tyr He Gly Lys Ser Lys Ser 
85 90 95 

Phe Gly Asn Leu Met Glu Ala Ser Asn Thr Asn Asp Leu Val Lys Val 
100 105 HO 

Glu Ser Pro Leu Asn Lys Arg Arg Arg Leu Leu He Ala Asn Lys Leu 
115 120. 125 

Arg Arg Arg Ser Ser Leu Ser Ser Phe Ser He Tyr Thr Lys He Asn 
130 ~ 135 140 

Pro Asn Ser Met Pro Leu Leu Ala Leu Gin Glu Ser Asp Asn Glu Asp 
145 150 155 160 

His Lys Leu Asn Asp Asp Asp Asp Asp Asp Asp Ser Ser Ser Asp Asp 
165 170 175 

Glu Thr Ser Lys Leu Lys Glu Lys Arg Met Lys Met Thr Asn His Arg 
180 * 185 190 

Asp Phe Met Val Pro Gin Thr Lys Ser Cys Phe Ser Leu Thr Ser Phe 
195 200 205 

Gin Asp Asp Asp Asp Arg 
210 



<210> 


74 


<211> 


221 


<212> 


PRT 


<213> 


atKCL2 aa seq 



<400> 74 

Met Val Gly Ser Ser Phe Gly He 
1 * 5 

His Arg Gly Val Ser Ala Gin Asp 
20 

Leu Ala Asp Glu Ser Gly Arg Gly 
35 40 

Arg Met Ser Asn Asn Asn Asn Lys 
50 55 



Gly Met Ala Ala Tyr Val Arg Asp 
10 15 

Lys Ala Val Gin Thr Ala Leu Phe 
25 30 

Gly Ser Gin He Gly He Gly Leu 
45 

Ser Pro Glu Glu Ser Ser Asp Ser 

6.0 
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Ser Ala 

x 


Asp Ser 


Glu 
5 


Val He Asp 


Ser 


His 
10 


Lys 


Arg Arg 


Ser Arg 


Arg Pro 
20 


Ser 


Tyr Arg Lys 

- 


He 
25 


Leu 


Asn 


Glu Leu 


Val Pro 


Gly He 
35 


Pro 


Lys He Glu 
40 


Glu 


Glu 






<210> 


80 














<211> 


42 














<212> 


PRT 














<213> 


cCREM aa 


seq 













15 



30 



<400> 80 

Ala Glu Ser Glu Gly Val He Asp Ser His Lys Arg Arg Glu He Leu 
1 5 10 15 

Ser Arg Arg Pro Ser Tyr Arg Lys He Leu Asn Glu Leu Ser Ser Asp 
20 25 30 

Val Pro Gly Val Pro Lys He Glu Glu Glu ■ 

40 





35 


<210> 


81 


<211> 


461 


<212> 


PRT 


<213> 


aa 



<400> 81 

Met Lys Leu Gin Ala Ser Ser Pro Gin Asp Asn Gin Pro Ser Asn Thr 
15 10 15 

Thr Asn Asn Ser Thr Asp Ser Asn His Leu Ser Met Asp Glu His Ala 
20 25 30 

Met Arg Ser Met Asp Trp Asp Ser He Met Lys Glu Leu Glu Val Asp 
35 40 45 

Asp Asp Ser Ala Pro Tyr Gin Leu Gin Pro Ser Ser Phe Asn Leu Pro 
50 55 60 

Val Phe Pro Asp He Asp Ser Ser Asp Val Tyr Pro Gly Pro Asn Gin 
65 70 75 80 

He Thr Gly Tyr Gly Phe Asn Ser Leu Asp Ser Val Asp Asn Gly Gly 
85 , 9° 95 

Phe Asp Tyr He Glu Asp Leu He Arg Val Val Asp Cys lie Glu Ser 
100 105 HO 
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Asp Glu Leu His Leu Ala His Val Val Leu Ser Gin Leu Asn Gin Arg 
115 120 125 

Leu Gin Thr Ser Ala Gly Arg Pro Leu Gin Arg Ala Ala Phe Tyr Phe 
130 135 140 

Lys Glu Ala Leu Gly Ser Leu Leu Thr Gly Thr Asn Arg Asn Gin Leu 
145 150 155 160 

Phe Ser Trp Ser Asp lie Val Gin Lys He Arg Ala He Lys Glu Phe 
165 1.70 175 

Ser Gly He Ser Pro He Pro Leu Phe Ser His Phe Thr Ala Asn Gin 
180 185 190 

Ala He Leu Asp Ser Leu Ser Ser Gin Ser Ser Ser Pro Phe Val His 
195 " 200 205 

Val Val Asp Phe Glu He Gly Phe Gly Gly Gin Tyr Ala Ser Leu Met 
210 215 220 

Arq Glu He Ala Glu Lys Ser Ala Asn Gly Gly Phe Leu Arg Val Thr 
225 230 235 240 

Ala Val Val Ala Glu Asp Cys Ala Val Glu Thr Arg Leu Val Lys Glu 
245 250 255 

Asn Leu Thr Gin Phe Ala Ala Glu Met Lys He Arg Phe Gin He Glu 
260 265 270 

Phe Val Leu Met Lys Thr Phe Glu He Leu Ser Phe Lys Ala He Arg 
275 . 280 285 

Phe Val Asp Gly Glu Arg Thr Val Val Leu He Ser Pro Ala He Phe 
' 290 ~ 295 300 

Arg Arg Val He Gly He Ala Glu Phe Val Asn Asn Leu Gly Arg Val 
305 310 315 320 

Ser Pro Asn Val Val Val Phe Val Asp Ser Glu Gly Cys Thr Glu Thr 
325 330 335 

Ala Gly Ser Gly Ser Phe Arg Arg Glu Phe Val Ser Ala Phe Glu Phe 
340 345 350 

Tyr Thr Met Val Leu Glu Ser Leu Asp Ala Ala Ala Pro Pro Gly Asp 
355 * 360 365 

Leu Val Lys Lys He Val Glu Thr Phe Leu Leu Arg Pro Lys He Ser 
370 375 380 

Ala Ala Val Glu Thr Ala Ala Asn Arg Arg Ser Ala Gly Gin Met Thr 
385 390 395 400 

Trp Arg Glu Met Leu Cys Ala Ala Gly Met Arg Pro Val Gin Leu Ser 
405 410 415 

Gin Phe Ala Asp Phe Gin Ala Glu Cys Leu Leu Glu Lys Ala Gin Val 
420 425 430 

Arg Gly Phe His Val Ala Lys Arg Gin Gly Glu Leu Val Leu Cys Trp 
435 440 445 
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His Gly Arg Ala Leu Val Ala Thr Ser Ala Trp Arg Phe 
450 455 460 

<210> 82 

<211> 486 

<212> PRT 

<213> aa seq of atSCL15 



<400> 82 

Met Lvs He Pro Ala Ser Ser Pro Gin Asp Thr Thr Asn Asn Asn Asn 
1 5 10 15 

Asn Thr Asn Ser Thr Asp Ser Asn His Leu Ser Met Asp Glu His Val 
20 25 30 

Met Arg Ser Met Asp Trp Asp Ser He Met Lys Glu Leu Glu Leu Asp 
35 40 45 

Asp Asp Ser Ala Pro Asn Ser Leu Lys Thr Gly Phe Thr Thr Thr Thr 
50 55 • 60 

Thr Asp Ser Thr He Leu Pro Leu Tyr Ala Val Asp Ser Asn Leu Pro 
65 ' 70 75 80 

Gly Phe Pro Asp Gin He Gin Pro Ser Asp Phe Glu Ser Ser Ser Asp 
85 ' 90 95 

Val Tyr Pro Gly Gin Asn Gin Thr Thr Gly Tyr Gly Phe Asn Ser Leu 
100 105 HO 

Asp Ser Val Asp Asn Gly Gly Phe Asp Phe He Glu Asp Leu He Arg 
115 120 125 

Val Val Asp Cys Val Glu Ser Asp Glu Leu Gin Leu Ala Gin Val Val 
130 135 140 

Leu Ser Arg Leu Asn Gin Arg Leu Arg Ser Pro Ala Gly Arg Pro Leu 
145 ~ 150 155 160 

Gin Arg Ala Ala Phe Tyr Phe Lys Glu Ala Leu Gly Ser Phe Leu Thr 
165 170 175 

Glv Ser Asn Arg Asn Pro lie Arg Leu Ser Ser Trp Ser Glu He Val 
180 185 190 

Gin Arg He Arg Ala He Lys Glu Tyr Ser Gly He Ser Pro He Pro 
195 200 205 

Leu Phe Ser His Phe Thr Ala Asn Gin Ala He Leu Asp Ser Leu Ser 
210 215 220 

Ser Gin Ser Ser Ser Pro Phe Val His Val Val Asp Phe Glu He Gly 
225 230 235 240 

Phe Gly Gly Gin Tyr Ala Ser Leu Met Arg Glu He Thr Glu Lys Ser 
245 250 255 
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Val Ser Gly Gly Phe Leu Arg Val Thr Ala Val Val Ala Glu Glu Cys 
260 265 270 

Ala Val Glu Thr Arg Leu Val Lys Glu Asn Leu Thr Gin Phe Ala Ala 
275 280 285 

Glu Met Lys lie Arg Phe Gin He Glu Phe Val Leu Met Lys Thr Phe 
290 295 300 

Glu Met Leu Ser Phe Lys Ala He Arg Phe Val Glu Gly Glu Arg Thr 
305 310 315 320 

Val Val Leu He Ser Pro Ala He Phe Arg Arg Leu Ser Gly He Thr 
325 330 335 

Asp Phe Val Asn Asn Leu Arg Arg Val Ser Pro Lys Val Val Val Phe 
340 345 350 

Val Asp Ser Glu Gly Trp Thr Glu He Ala Gly Ser Gly Ser Phe Arg 
355 360 365 

Arg Glu Phe Val Ser Ala Leu Glu Phe Tyr Thr Met Val Leu Glu Ser 
370 375 380 

Leu Asp Ala Ala Ala Pro Pro Gly Asp Leu Val Lys Lys He Val Glu 
385 390 395 400 

Ala Phe Val Leu Arg Pro Lys He Ser Ala Ala Val Glu Thr Ala Ala 
405 410 415 

Asp Arg Arg His Thr Gly Glu Met Thr Trp Arg Glu Ala Phe Cys Ala 
420 425 430 

Ala Gly Met Arg Pro He Gin Gin Ser Gin Phe Ala Asp Phe Gin Ala 
435 440 445 

Glu Cys Leu Leu Glu Lys Ala Gin Val Arg Gly Phe His Val Ala Lys 
450 455 460 

Arg Gin Gly Glu Leu Val Leu Cys Trp His Gly Arg Ala Leu Val Ala 
465 470 475 480 

Thr Ser Ala Trp Arg Phe 
485 



<210> 


83 


<211> 


536 


<212> 


PRT 


<213> 


aa i 



<400> 83 

Met Lys Val Pro Phe Ser Thr Asn 
1 5 

Val Asn Ser Asn Asn Ser Phe. Thr 
20 



Asp Asn Val Ser Ser Lys Pro Leu 
10 15 

Phe Pro Ala Ala Thr Asn Gly Ser 
25 30 
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Asn Leu Cys Tyr Glu Pro Lys Ser Val Leu Glu Leu Arg Arg Ser Pro 
35 " 40 45 

Ser Pro He Val Asp Lys Gin He He Thr Thr Asn Pro Asp Leu Ser 
50 55 60 

Ala Leu Cys Gly Gly Glu Asp Pro Leu Gin Leu Gly Asp His Val Leu 
65 " 70 75 80 

Ser Asn Phe Glu Asp Trp Asp Ser Leu Met Arg Glu Leu Gly Leu His 
85 90 95 

Asp Asp Ser Ala Ser Leu Ser Lys Thr Asn Pro Leu Thr His Ser Glu 
100 105 110 

Ser Leu Thr Gin Phe His Asn Leu Ser Glu Phe Ser Ala Glu Ser Asn 
115 120 125 

Gin Phe Pro Ser Pro Asp Phe Ser Phe Ser Asp Thr Asn Phe Pro Gin 
130 135 140 

Gin Phe Pro Thr Val Asn Gin Ala Ser Phe He Asn Ala Leu Asp Leu 
145 150 155 160 

Ser Gly Asp He His Gin Asn Trp Ser Val Gly Phe Asp Tyr Val Asp 
165 170 175 

Glu Leu He Arg Phe Ala Glu Cys Phe Glu Thr Asn Ala Phe Gin Leu 
180 185 190 

Ala His Val He Leu Ala Arg Leu Asn Gin Arg Leu Arg Ser Ala Ala 
195 200 205 

Gly Lys Pro Leu Gin Arg Ala Ala Phe Tyr Phe Lys Glu Ala Leu Gin 
210 215 220 

Ala Gin Leu Ala Gly Ser Ala Arg Gin Thr Arg Ser Ser Ser Ser Ser 
225 230 235 240 

Asp Val He Gin Thr He Lys Ser Tyr Lys He Leu Ser Asn He Ser 
245 250 255 

Pro He Pro Met Phe Ser Ser Phe Thr Ala Asn Gin Ala Val Leu Glu 
260 265 270 

Ala Val Asp Gly Ser Met Leu Val His Val He Asp Phe Asp He Gly 
275 280 285 

Leu Gly Gly His Trp Ala Ser Phe Met Lys Glu Leu Ala Asp Lys Ala 
290 ~ 295 300 

Glu Cys Arg Lys Ala Asn Ala Pro He Leu Arg He Thr Ala Leu Val 
305 310 315 320 

Pro Glu Glu Tyr Ala Val Glu Ser Arg Leu He Arg Glu Asn Leu Thr 
325 330 335 

Gin Phe Ala Arg Glu Leu Asn He Gly Phe Glu He Asp Phe Val Leu 
340 345 350 

He Arg Thr Phe Glu Leu Leu Ser Phe Lys Ala He Lys Phe Met Glu 
355 360 365 
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Gly Glu Lys Thr Ala Val Leu Leu Ser Pro Ala He Phe Arg Arg Val 
370 > 375 380 

Gly Ser Gly Phe Val Asn Glu Leu Arg Arg He Ser Pro Asn Val Val 
385 390 395 400 

Val His Val Asp Ser Glu Gly Leu Met Gly Tyr Gly Ala Met Ser Phe 
405 410 415 

Arq Gin Thr Val He Asp Gly Leu Glu Phe Tyr Ser Thr Leu Leu Glu 
420 ~ 425 430 

Ser Leu Glu Ala Ala Asn He Gly Gly Gly Asn Cys Gly Asp Trp Met 
435 440 445 

Arq Lys He Glu Asn Phe Val Leu Phe Pro Lys He Val Asp Met He 
3 450 455 460 

Gly Ala Val Gly Arg Arg Gly Gly Gly Gly Ser Trp Arg Asp Ala Met 
465 . 470 475 480 

Val Asp Ala Gly Phe Arg Pro Val Gly Leu Ser Gin Phe Ala Asp Phe 
485 490 495 

Gin Ala Asp Cys Leu Leu Gly Arg Val Gin Val Arg Gly Phe His Val 
500 505 510 

Ala' Lys Arg Gin Ala Glu Met Leu Leu Cys Trp His Asp Arg Ala Leu 
515 520 525 

Val Ala Thr Ser Ala Trp Arg Cys 
530 535 

<210> 84 

<211> 26 

<212> DNA 

<213> BnSCLl sense primer 



<400> 84 26 
gatggacgaa catgccatgc gttcca 

<210> 85 

<211> 21 

<212> DNA 

<213> BnSCLl anti-sense primer 



<400> 85 21 
cgctcggatc ttctgaacaa t 

<210> 86 
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<211> 21 

<212> DNA 

<213> BnlAAl sense primer 

<400> 86 

ccacgcgtcc ggtacgatga t 21 

<210> 87 

<211> 22 

<212> DNA 

<213> BnlAAl anti-sense primer 



<400> 87 

gaagttgaga aatggtttat ga 22 

<210> 88 

<211> 21 

<212> DNA 

<213> BnlAAl 2 sense primer 



<400> 88 

acgctggtgc ttctcctcct c 21 

<210> 89 

<211> 24 

<212> DNA 

<213> BNIAA12 anti-sense primer 



<400> 89 

aaaacccatt agaagaacca agaa 24 

<210> 90 
<211> 33 
<212> DNA 

<213> forward primer for BnSCLl, BnSCLl 1-358, BnSCLl 1-261, BnSCLl 1-217 
and BnSCLl 1-145 for pET-28b vector 
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<400> 90 

gcaagcttat ggacgaacat gccatgcgtt cca 

<210> 91 
<211> 29 
<212> DNA 

<213> reverse primer for BnSCLl for pET-28b vector 
<400> 91 

cgctcgagaa agcgccacgc tgacgtggc 

<210> 92 
<211> 29 
<212> DNA 

<213> reverse primer for BnSCLl 1-358 for pET-2 8b vector 
<400> 92 

cgctcgagcg cggagatctt cggacgtaa 

<210> 93 
<211> 29 
<212> DNA 

<213> reverse primer for BnSCLl 1-261 for pET-28b vector 
<400> 93 

cgctcgagcc taatcgcctt gaaagataa 

<210> . 94 
<211> 29 
<212> DNA 

<213> reverse primer for BnSCLl 1-217 for pET-28b vector 
<400> 94 

cgctcgagcg ccacaaccgc cgtgactct 
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<210> 104 

<211> 5 

<212> PRT 

<213> aa seq of LXXLL motif (148LGSLL152) 

<400> 104 

Leu Gly Ser Leu Leu 
1 5 
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